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METHOD FOR PREPARING POLYFUNCTIONALIZED PEPTIDES 
AND/OR PROTEINS VIA NATIVE CHEMICAL LIGATION 

Priority Claim 

5 [0001] This application claims priority to provisional application Nos.: 

60/500,708, filed September 5, 2003; 60/560,147, filed April 7, 2004; each of the 
above-cited Applications is hereby incorporated by reference in its entirety. 


Government Support 
10 [0002] The invention was supported in part by Grant Nos.: CA1 03823 

(formerly All 6943), T32-CA62948 and AI051883 from the National Institutes of 
Health. The U.S. government may have certain rights in this invention. 


Background of the Invention 

15 [0003] Glycoproteins are important biomacromolecules that are 

biosynthesized through posttranslational glycosylation of newly fashioned proteins 
emerging from the ribosome. Interest in glycoproteins arises at many levels. A 
long-term goal of the growing field of chemistry-based glycobiology is the 
delineation of the consequences of glycosylation on critical properties such as 

20 protein folding, proteolytic stability, and cell adhesion. 1 Such insights could explain 
why nature bothers to glycosylate otherwise functional proteins. Moreover, 
glycoproteins have potentially important clinical roles in the context of vaccines, 
diagnostics, and therapeutics. Indeed., erythropoietin, albeit a heterogeneous 
glycoprotein, 5 is clinically valuable as a treatment for anemia, among other 

25 indications. 48 

[0004] Many naturally occurring, medicinally important glycoproteins (cf, 

for example, erythropoietin 48 and gpl20 n ) display multiple glycosylation sites 
containing large oligosaccharide domains. However, given the complexity and 
variability of biological glycosylation pathways, 10 the isolation of homogeneous 
30 glycoproteins from natural sources in significant quantity is extremely difficult. 

[0005] Numerous methods exist for the production of glycopeptides by 

chemical synthesis. For example, glycans have been introduced into peptides via 
amino acid "cassettes" with pendant protected saccharides, 26 " 31 through enzymatic 
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manipulations of glycopeptides, 32 " 38 or by conjugation of fully elaborated, complex 
saccharides to short synthetic peptides. 39 " 41 Larger 0-linked glycopeptides have been 
synthesized using ligation techniques 42,43 such as expressed protein ligation. 44 " 46 
Bertozzi and coworkers extended the scope of the "cassette" approach by applying 

5 native chemical ligation to the synthesis of a biologically active glycoprotein with 
two single-residue O-linked glycans. 47 Tolbert and Wong described the ligation of a 
392-residue intein-generated peptide thioester and a dipeptide functionalized with a 
single iV-acetylglucosamine residue. However, none of these approaches has 
allowed the assembly of complex glypeptides or glycoproteins multiply 

10 functionalized (e.g. , multiply glycosylated) at designated sites. 

[0006] Accordingly, there remains a need for novel synthetic methods 

leading to the preparation of homogeneous polyfunctionalized peptides and/or 
proteins. Specifically, convergent, stereoselective, versatile methods for preparing 
such glycopeptides and/or glycoproteins are needed. 


peptides and proteins, the present invention, in one aspect, provides a system for 
preparing a polyfunctionalized peptide comprising a peptidic backbone made up of 
20 four or more amino acids, wherein two or more non-adjacent amino acids are 
independently substituted with a moiety having the structure: 


with the proviso that the peptide sequence between any two consecutive, 
non-adjacent, amino acids bearing a A-L 1 - moiety comprises at least one cysteine 
25 residue; 

wherein the method comprises a step of: 

reacting a peptide acyl donor comprising a peptidic backbone made up of 
two or more amino acids wherein said peptide acyl donor has the structure: 


15 


Summary of the Invention 


[0007] 


In recognition of the need to provide access to polyfunctionalized 
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a Peptide Backbone 

R X1 HN 1 



.OR X0 


O 

with a peptide amine acceptor having the structure: 


(Ski) 



1 Peptide Backbone 


Y 

O 


under suitable conditions to effect ligation; 
5 wherein kl and k2 are independently integers between 1 and about 20; 

each occurrence of A, Ai and A 2 is independently an aliphatic, 
heteroaliphatic, aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutically 
useful group or entity; 

R S1 is a sulfide protecting group; 
10 R xo is a group such that the moiety -C(=O)OR X0 can be made to undergo 

ligation with the peptide amine acceptor; 

each occurrence of L 1 is independently a substituted or unsubstituted, linear 
or branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic 
moiety; 

15 R X1 is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 

alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

R* 2 is -OR* 28 or -NR X2b R X2c , wherein R* 23 is hydrogen, alkyl, aromatic, 
• heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 
20 protecting group, an amino acid or a proctected amino acid; and and R c are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid. 
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[0008] In certain embodiments, each occurrence of A is independently a 

pharmaceutically useful group or entity. In certain embodiments, each occurrence of 
A is independently a biomolecule, a small molecule, a macromolecule or a 
diagnostic label. 

5 [0009] In certain exemplary embodiments, each occurrence of A is 

independently a carbohydrate determinant having the structure: 

-*z 

wherein a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
the proviso that the x, y and z bracketed structures represent furanose or pyranose 

10 moieties and the sum of b and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 
g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
wherein Ro is hydrogen, a linear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R2, R3, Rt, R5, R* , R7, Rs and R9 is 
independently hydrogen, OH, OR', NHR 1 , NHCOR', F, CH 2 OH, CHzOR 5 , a 

15 substituted or unsubstituted , linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R 1 is independently hydrogen, CHO, COOR", or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 
saccharide moiety having the structure: 


20 



wherein Y and Z are independently NH or O; wherein k, 1, r, s, t, u, v and w 
are each independently 0, 1 or 2; with the proviso that the v and w bracketed 
structures represent furanose or pyranose moieties and the sum of 1 and k is 1 or 2, 
and the sum of s and u is 1 or 2, and with the proviso that v and w are not 



Page 4 of 74 


WO 2005/044841 


PCT/US2004/029047 


simultaneously 0; wherein R' 0 is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rio, Rn, R12, R13, R14 and R i5 is 
independently hydrogen, OH, OR iH , NHR 1 ", NHCOR'", F, CH 2 OH, CEbOR 1 ", or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 

5 tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of Ri 6 is hydrogen, COOH, COOR n , CONHR 1 ', a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occurrence 
of R Ui is hydrogen, CHO, COOR iv , or a substituted or unsubstituted linear or 
branched chain alkyl, acyl, arylalkyl or aryl. group; and wherein each occurrence of 

10 R" and R iv are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group. 

[0010] In certain embodiments, the present invention provides a method for 

preparing a polyfunctionalized peptide having the structure: 



15 wherein si and s2 are independently an integer from 1 to about 20; 

tl, t2 and t3 are each independently an integer; 

R X1 is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; 

20 R X2 is -OR* 23 or -NR^R* 20 , wherein R* 23 is hydrogen, alkyl, aromatic, 

heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 
protecting group, an amino acid or a proctected amino acid; and R* 215 and R* 20 are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 

25 proctected amino acid; 
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R P1 , R* 2 and R P3 are independently H, alkyl, heteroalkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain; 

each occurrence of L 1 is independently a substituted or unsubstituted 
5 aliphatic or heteroaliphatic moiety; 

Ai and A 2 are each independently an aliphatic, heteroaliphatic, aromatic, 
heteroaromatic, aryl, heteroaryl or a pharmaceutical^ useful group or entity; and 

at least one occurrence of the bracketed structure t2 is a cysteine residue or 
protected cysteine residue; 
10 wherein the method comprise a step of: 

reacting a peptide acyl donor having the structure: 



with a peptide amine acceptor having trie structure: 



15 under suitable conditions to effect ligation; 

wherein the sum t+t' equals (t2)+l ; R S1 is a sulfide protecting group; and R x0 
is a group such that the moiety -C(=O)OR x0 can be made to undergo ligation with 
the glycopeptide amine acceptor. 

[0011] In certain exemplary embodiments, Ai and A 2 are each independently 

20 a carbohydrate domain as defined for A above. 
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[0012] In certain embodiments, the step may be carried out once, or repeated 

a desired number of times, to prepare the polyfunctionalized peptide having the 


0 


R P0 



5 wherein R X1 and R* 2 are as defined above; 

each occurrence of A may be the same or different and may be as defined for 
Ai and A 2 above; 

each occurrence of R P1 may be the same or different and may be as defined 
for R pl and R K above; 
10 q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer from 1 to about 20; 
each occurrence oft is independently an integer; 
tO is an integer; and 

each occurrence of R po is independently H, alkyl, heteroalkyl, aromatic, 
15 heteroaromatic, aryl, heteroaryl -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain. 

Definitions 

[0013] Certain specific functional groups defined in the inventive method are 

20 described in more detail below. For purposes of this invention, the chemical 
elements are identified in accordance with the Periodic Table of the Elements, CAS 
version, Handbook of Chemistry and Physics, 75 th Ed., inside cover, and specific 
functional groups are defined as described therein. Additionally, general principles 
of organic chemistry, as well as specific functional moieties and reactivity, are 
25 described in "Organic Chemistry", Thomas Sorrell, University Science Books, 
Sausalito: 1999, the entire contents of which are incorporated herein by reference. 
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[0014] It will be appreciated that the compounds, as described herein, may 

be substituted with any number of substituents or functional moieties. In general, 
the term "substituted" whether preceded by the term "optionally" or not, and 
substituents contained in formulas of this invention, refer to the replacement of 
hydrogen radicals in a given structure with, the radical of a specified substituent. 
When more than one position in any given structure may be substituted with more 
than one substituent selected from a specified group, the substituent may be either 
the same or different at every position. As used herein, the term "substituted" is 
contemplated to include all permissible substituents of organic compounds. In a 
broad aspect, the permissible substituents include acyclic and cyclic, branched and 
unbranched, carbocyclic and heterocyclic, aromatic and non-aromatic, carbon and 
heteroatom substituents of organic compounds. For purposes of this invention, 
heteroatoms such as nitrogen may have hydrogen substituents and/or any 
permissible substituents of organic compounds described herein which satisfy the 
valencies of the heteroatoms. Furthermore, this invention is not intended to be 
limited in any manner by the permissible substituents of organic compounds. 
Combinations of substituents and variables envisioned by this invention are 
preferably those that result in the formation of stable compounds useful in the 
treatment and prevention, for example of disorders, as described generally above. 
Examples of substituents include, but are not limited to aliphatic; heteroaliphatic; 
alicyclic; heteroalicyclic; aromatic, heteroaromatic; aryl; heteroaryl; alkylaryl; 
alkylheteroaryl; alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; 
heteroalkylthio; heteroarylthio; F; CI; Br; I; -N0 2 ; -CN; -CF 3 ; -CH 2 CF 3 ; -CHC1 2 ; - 
CH 2 OH; -CH 2 CH 2 OH; -CH 2 NH 2 ; -CH 2 S0 2 CH 3 ; - or — GR G1 wherein G is -O-, -S-, - 
NR G2 -, -C(=0)-, -S(=0)-, -S0 2 -, -C(=O)0-, -C(=0)NR G \ -OC(=0>, -NR^C^O)-, 
-OC(=0)0-, -OC(=0)NR G2 -, -NR G2 C(=0)0-, -NR G2 C(=0)NR G2 -, -C(=S>, - 
C (=S)S-, -SC(=S)-, -SC(=S)S-, -C(=NR G2 >, -C(=NR G2 )0-, -C(=NR G2 )NR G3 -, - 
OC(=NR G2 )-, -NR G2 C(=NR G3 )-, -NR G2 S0 2 -, -NR G2 S0 2 NR G3 -, or -S0 2 NR G2 -, 
wherein each occurrence of R G1 , R 02 and R G3 independently includes, but is not 
) limited to, hydrogen, halogen, or an optionally substituted aliphatic, heteroaliphatic, 
alicyclic, heteroalicyclic, aromatic, heteroaromatic, aryl, heteroaryl, alkylaryl, or 
alkylheteroaryl moiety. Additional examples of generally applicable substituents are 
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illustrated by the specific embodiments shown in the Examples that are described 
herein. 

[0015] The term "stable", as used herein, preferably refers to compounds 

which possess stability sufficient to allow manufacture and which maintain the 
integrity of the compound for a sufficient period of time to be useful for the 
purposes detailed herein. 

[0016] The term "aliphatic", as used herein, includes both saturated and 

unsaturated, straight chain (i.e., unbranched) or branched aliphatic hydrocarbons, 
which are optionally substituted with one or more functional groups. As will be 
appreciated by one of ordinary skill in the art, "aliphatic" is intended herein to 
include, but is not limited to, alkyl, alkenyl, alkynyl moieties. Thus, as used herein, 
the term "alkyl" includes straight and branched alkyl groups. An analogous 
convention applies to other generic terms such as "alkenyl", "alkynyl" and the like. 
Furthermore, as used herein, the terms "alkyl", "alkenyl", "alkynyl" and the like 
encompass both substituted and unsubstituted groups. In certain embodiments, as 
used herein, "lower alkyl" is used to indicate those alkyl groups (substituted, 
unsubstituted, branched or unbranched) having about 1-6 carbon atoms. 
[0017] In certain embodiments, the alkyl, alkenyl and alkynyl groups 

employed in the invention contain about 1-20 aliphatic carbon atoms. In certain 
other embodiments, the alkyl, alkenyl, and alkynyl groups employed in the invention 
contain about 1-10 aliphatic carbon atoms. In yet other embodiments, the alkyl, 
alkenyl, and alkynyl groups employed in the invention contain about 1-8 aliphatic 
carbon atoms. In still other embodiments, the alkyl, alkenyl, and alkynyl groups 
employed in the invention contain about 1-6 aliphatic carbon atoms. In yet other 
i embodiments, the alkyl, alkenyl, and alkynyl groups employed in the invention 
contain about 1-4 carbon atoms. Illustrative aliphatic groups thus include, but are 
not limited to, for example, methyl, ethyl, n-propyl, isopropyl, allyl, n-butyl, sec- 
butyl, isobutyl, tert-butyl, n-pentyl, sec-pentyl, isopentyl, tert-pentyl, n-hexyl, sec- 
hexyl, moieties and the like, which again, may bear one or more substituents. 
0 Alkenyl groups include, but are not limited to, for example, ethenyl, propenyl, 
butenyl, l-methyl-2-buten-l-yl, and the like. Representative alkynyl groups include, 
but are not limited to, ethynyl, 2-propynyl (propargyl), 1-propynyl and the like. 
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[0018] The term "alicyclic", as used herein, refers to compounds which 

combine the properties of aliphatic and cyclic compounds and include but are not 
limited to cyclic, or polycyclic aliphatic hydrocarbons and bridged cycloalkyl 
compounds, which are optionally substituted with one or more functional groups. 
5 As will be appreciated by one of ordinary skill in the art, "alicyclic" is intended 
herein to include, but is not limited to, cycloalkyl, cycloalkenyl, and cycloalkynyl 
moieties, which are optionally substituted with one or more functional groups. 
Illustrative alicyclic groups thus include, but are not limited to, for example, 
cyclopropyl, -CH 2 -cyclopropyl, cyclobutyl, -CH 2 -cyclobutyl, cyclopentyl, -CH 2 - 
10 cyclopentyl-n, cyclohexyl, -CH 2 -cyclohexyl, cyclohexenylethyl, cyclohexanylethyl, 
norborbyl moieties and the like, which again, may bear one or more substituents. 
[0019] The term "cycloalkyl", as used herein, refers specifically to groups 

having three to seven, preferably three to ten carbon atoms. Suitable cycloalkyls 
include, but are not limited to cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, 
15 cycloheptyl and the like, which, as in the case of aliphatic, heteroaliphatic or 
heterocyclic moieties, may optionally be substituted. An analogous convention 
applies to other generic terms such as "cycloalkenyl", "cycloalkynyl" and the like. 
[0020] The term "heteroaliphatic", as used herein, refers to aliphatic moieties 

in which one or more carbon atoms in the main chain have been substituted with a 
20 heteroatom. Thus, a heteroaliphatic group refers to an aliphatic chain which 
contains one or more oxygen, sulfur, nitrogen, phosphorus or silicon atoms, e.g., in 
place of carbon atoms. Heteroaliphatic moieties may be branched or linear 
unbranched. In certain embodiments, heteroaliphatic moieties are substituted by 
independent replacement of one or more of the hydrogen atoms thereon with one or 
25 more moieties including, but not limited to aliphatic; heteroaliphatic; alicyclic; 
heteroalicyclic; aromatic, heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; 
alkoxy; aryloxy; heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; 
heteroarylthio; F; CI; Br; I; -N0 2 ; -CN; -CF 3 ; -CH 2 CF 3 ; -CHC1 2 ; -CH 2 OH; - 
CH 2 CH 2 OH; -CH 2 NH 2 ; -CH 2 S0 2 CH 3 ; - or -GR G1 wherein G is -O-, -S-, -NR° 2 -, - 
30 c(=0)-, -S(=0)-, -S0 2 -, -C(=0)O-, -C(=0)NR G2 -, -OC(=0)-, -NR G2 C(=0)-, - 
0 C(=0)0-, -OC(=0)NR G2 -, -NR G2 C(=0)0-, -NR G2 C(=0)NR G2 -, -C(=S)-, - 
C (=S)S-, -SC(=S)-, -SC(=S)S-, -C(=NR G2 )-, -C(=NR G2 )0-, -C(=NR G2 )NR° 3 -, - 
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OC(=NR G2 )-, -NR G2 C(=NR G3 )-, -NR^SO,-, -NR G2 S0 2 NR G \ or -S0 2 NR G2 -, 
wherein each occurrence of R G1 , R 02 and R G3 independently includes, but is not 
limited to, hydrogen, halogen, or an optionally substituted aliphatic, heteroaliphatic, 
alicyclic, heteroalicyclic, aromatic, heteroaromatic, aryl, heteroaryl, alkylaryl, or 
5 alkylheteroaryl moiety. Additional examples of generally applicable substituents are 
illustrated by the specific embodiments shown in the Examples that are described 
herein. 

[0021] The term "heteroalicyclic", "heterocycloalkyl" or "heterocyclic", as 

used herein, refers to compounds which combine the properties of heteroaliphatic 
10 and cyclic compounds and include but are not limited to saturated and unsaturated 
mono- or polycyclic heterocycles such as morpholino, pyrrolidinyl, furanyl, 
thiofuranyl, pyrrolyl etc., which are optionally substituted with one or more 
functional groups, as defined herein. In. certain embodiments, the term 
"heterocyclic" refers to a non-aromatic 5-, 6- or 7- membered ring or a polycyclic 
15 group, including, but not limited to a bi- or tri-cyclic group comprising fused six- 
membered rings having between one and three heteroatoms independently selected 
from oxygen, sulfur and nitrogen, wherein (i) each 5-membered ring has 0 to 2 
double bonds and each 6-membered ring has O to 2 double bonds, (ii) the nitrogen 
and sulfur heteroatoms may optionally be oxidized, (iii) the nitrogen heteroatom 
20 may optionally be quaternized, and (iv) any of the above heterocyclic rings may be 
fused to an aryl or heteroaryl ring. Representative heterocycles include, but are not 
limited to, pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, 
piperidinyl, piperazinyl, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, 
isothiazolidinyl, and tetrahydrofuryl. In certain embodiments, a "substituted 
25 heterocycloalkyl or heterocycle" group is utilized and as used herein, refers to a 
heterocycloalkyl or heterocycle group, as defined above, substituted by the 
independent replacement of one, two or three of the hydrogen atoms thereon with 
but are not limited to aliphatic; heteroaliphatic; alicyclic; heteroalicyclic; aromatic, 
heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; 
30 heteroalkoxy; heteroaryloxy; alkylthio; arylthio; heteroalkylthio; heteroarylthio; F; 
CI; Br; I; -N0 2 ; -CN; -CF 3 ; -CH 2 CF 3 ; -CHC1 2 ; -CH 2 OH; -CH 2 CH 2 OH; -CH 2 NH 2 ; - 
CH 2 S0 2 CH 3 ; - or — GR G1 wherein G is -O-, -S-, -NR G2 -, -C(=Q)-, -S(=0>, -SQ 2 -, - 
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C(=O)0-, -C(=0)NR G2 -, -OC(=0)-, -NR G2 C(=0)-, -OC(=0)0-, -OC^CONR 02 -, - 
NR G2 C(=0)0-, -NR G2 C(=0)NR G2 - 5 -C(=S)-, -C(=S)S-, -SC(=S)-, -SC(=S)S-, - 
C(=NR G2 )-, -C(=NR G2 )0-, -C(=NR G2 )NR G3 -, -OC(=NR G2 )-, -NR G2 C(=NR G3 )-, - 
NR G2 S0 2 -, -NR G2 S0 2 NR G3 -, or -S0 2 NR G2 -, wherein each occurrence of R G1 , R 02 
5 and R G3 independently includes, but is not limited to, hydrogen, halogen, or an 
optionally substituted aliphatic, heteroaliphatic, alicyclic, heteroalicyclic, aromatic, 
heteroaromatic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl moiety. Additional 
examples or generally applicable substituents are illustrated by the specific 
embodiments shown in' the Examples, which are described herein. 
10 [0022] Additionally, it will be appreciated that any of the alicyclic or 

heteroalicyclic moieties described above and herein may comprise an aryl or 
heteroaryl moiety fused thereto. Additional examples of generally applicable 
substituents are illustrated by the specific embodiments shown in the Examples that 
are described herein. 

15 [0023] In general, the term "aromatic moiety", as used herein, refers to 

stable substituted or unsubstituted unsaturated mono- or polycyclic hydrocarbon 
moieties having preferably 3-14 carbon atoms, comprising at least one ring 
satisfying the Huckel rule for aromaticity. Examples of aromatic moieties include, 
but are not limited to, phenyl, indanyl, indenyl, naphthyl, phenanthryl and anthracyl. 

20 [0024] In general, the term "heteroaromatic moiety", as used herein, refers to 

stable substituted or unsubstituted unsaturated mono-heterocyclic or 
polyheterocyclic moieties having preferably 3-14 carbon atoms, comprising at least 
one ring satisfying the Huckel rule for aromaticity. Examples of heteroaromatic 
moieties include, but are not limited to, pyridyl, quinolinyl, dihydroquinolinyl, 

25 isoquinolinyl, quinazolinyl, dihydroquinazolyl, and tetrahydroquinazolyl. 

[0025] It will also be appreciated that aromatic and heteroaromatic moieties, 

as defined herein, may be attached via an aliphatic (e.g., alkyl) or heteroaliphatic 
(e.g., heteroalkyl) moiety and thus also include moieties such as - 
(aliphatic)aromatic, -(heteroaliphatic)aromatic, -(aliphatic)heteroaromatic, - 

30 (heteroaliphatic)heteroaromatic, -(alkyl)aromatic, -(heteroalkyl) aromatic, 

(alkyl)heteroaromatic, and -(heteroalkyl)heteroaromatic moieties. Thus, as used 
herein, the phrases "aromatic or heteroaromatic moieties" and "aromatic, 
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heteroaromatic, -(alkyl)aromatic, -(heteroalkyl)aromatic, 

(heteroalkyl)heteroaromatic, and -(heteroalkyl)heteroaromatic" are interchangeable. 
Substituents include, but are not limited to, any of the previously mentioned 
substituents, i.e., the substituents recited for aliphatic moieties, or for other moieties 
as disclosed herein, resulting in the formation of a stable compound. 
[0026] In general, the term "aryl" refers to aromatic moieties, as described 

above, excluding those attached via an aliphatic (e.g., alkyl) or heteroaliphatic (e.g., 
heteroalkyl) moiety. In certain embodiments of the present invention, "aryl" refers to 
a mono- or bicyclic carbocyclic ring system having one or two rings satisfying the 
Huckel rule for aromaticity, including, but not limited to, phenyl, naphthyl, 
tetrahydronaphthyl, indanyl, indenyl and the like. 

[0027] Similarly, the term "heteroaryl" refers to heteroaromatic moieties, as 

described above, excluding those attached via an aliphatic (e.g., alkyl) or 
heteroaliphatic (e.g., heteroalkyl) moiety. In certain embodiments of the present 
invention, the term "heteroaryl", as used herein, refers to a cyclic unsaturated radical 
having from about five to about ten ring atoms of which one ring atom is selected 
from S, O and N; zero, one or two ring atoms are additional heteroatoms 
independently selected from S, O and N; and the remaining ring atoms are carbon, 
the radical being joined to the rest of the molecule via any of the ring atoms, such as, 
, for example, pyridyl, pyrazinyl, pyrimidinyl, pyrrolyl, pyrazolyl, imidazolyl, 
thiazolyl, oxazolyl, isooxazolyl, thiadiazolyl, oxadiazolyl, thiophenyl, furanyl, 
quinolinyl, isoquinolinyl, and the like. 

[0028] Substituents for aryl and heteroaryl moieties include, but are not 

limited to, any of the previously mentioned substitutents, i.e., the substituents recited 

5 for aliphatic moieties, or for other moieties as disclosed herein, resulting in the 
formation of a stable compound. For example, aryl and heteroaryl groups (including 
bicyclic aryl groups) can be unsubstituted or substituted, wherein substitution 
includes replacement of one, two or three of the hydrogen atoms thereon 
independently with any one or more of the following moieties including, but not 

0 limited to: aliphatic; heteroaliphatic; alicyclic; heteroalicyclic; aromatic, 
heteroaromatic; aryl; heteroaryl; alkylaryl; alkylheteroaryl; alkoxy; aryloxy; 
heteroalkoxy; heteroaryloxy; alkylthio; arylthio; lieteroalkylthio; heteroarylthio; F; 
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CI; Br; I; -N0 2 ; -CN; -CF 3 ; -CH 2 CF 3 ; -CHC1 2 ; -CH 2 OH; -CH 2 CH 2 OH; -CH 2 NH 2 ; - 
CH 2 S0 2 CH 3 ; - or — GR G1 wherein G is -O-, -S-, -NR G2 -, -C(=0)-, -S(=0)-, -S0 2 -, - 
C(=0)0-, -C(=0)NR G2 -, -OC(=0)-, -NR G2 C(=0)-, -OC(=0)0-, -OC(=0)NR G2 -, - 
NR G2 C(=0)0-, -NR G2 C(=0)NR 02 -, -C(=S)-, -C(=S)S-, -SC(=S)-, -SC(=S)S-, - 

5 C(=NR G2 )-, -C(=NR G2 )0-, -C(=NR G2 )NR. G3 -, -OC(=NR G2 >, -NR G2 C(=NR G3 )-, - 
NR^SOz-, -NR G2 S0 2 NR G3 -, or -S0 2 NR G2 -, wherein each occurrence of R G1 , R 02 
and R G3 independently includes, but is not limited to, hydrogen, halogen, or an 
optionally substituted aliphatic, heteroaliphatic, alicyclic, heteroalicyclic, aromatic, 
heteroaromatic, aryl, heteroaryl, alkylaryl, or alkylheteroaryl moiety. Additional 

10 examples of generally applicable substituents are illustrated by the specific 
embodiments shown in the Examples that are described herein. 
[0029] The terms "alkoxy" (or "alkyloxy"), and "thioalkyl" as used herein 

refers to an alkyl group, as previously defined, attached to the parent molecular 
moiety through an oxygen atom ("alkoxy") or through a sulfur atom ("thioalkyl"). 

15 In certain embodiments, the alkyl group contains about 1-20 aliphatic carbon atoms. 
In certain other embodiments, the alkyl group contains about 1-10 aliphatic carbon 
atoms. In yet other embodiments, the alkyl group contains about 1-8 aliphatic 
carbon atoms. In still other embodiments, the alkyl group contains about 1-6 
aliphatic carbon atoms. In yet other embodiments, the alkyl group contains about 1- 

20 4 aliphatic carbon atoms. Examples of alkoxy groups, include but are not limited to, 
methoxy, ethoxy, propoxy, isopropoxy, n-butoxy, tert-butoxy, neopentoxy and n- 
hexoxy. Examples of thioalkyl groups include, but are not limited to, methylthio, 
ethylthio, propylthio, isopropylthio, n-butylthio, and the like. 

[0030] The term "amine" refers to a group having the structure -N(R) 2 

25 wherein each occurrence of R is independently hydrogen, or an aliphatic, 
heteroaliphatic, aromatic or heteroaromatic moiety, or the R groups, taken together, 
may form a heterocyclic moiety. 

[0031] The term "alkylamino" refers to a group having the structure - 

NHR'wherein R' is alkyl, as defined herein. The term "aminoalkyl" refers to a 
30 group having the structure NH 2 R'-, wherein R' is alkyl, as defined herein. In certain 
embodiments, the alkyl group contains about 1-20 aliphatic carbon atoms. In certain 
other embodiments, the alkyl group contains about 1-10 aliphatic carbon atoms. In 
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yet other embodiments, the alkyl, alkenyl, and alkynyl groups employed m the 
invention contain about 1-8 aliphatic carbon atoms. In still other embodiments, the 
alkyl group contains about 1-6 aliphatic carbon atoms. In yet other embodiments, 
the alkyl group contains about 1-4 aliphatic carbon atoms. Examples of alkylamino 
5 include, but are not limited to, methylamino, ethylamino, iso-propylamino and the 
like. 

[0032] The terms "halo" and "halogen" as used herein refer to an atom 

selected from fluorine, chlorine, bromine and iodine. 

[0033] The term "acyloxy", as used herein, does not substantially differ from 

10 the common meaning of this term in the art, and refers to a moiety of structure - 
OC(0)Rx, wherein R x is a substituted or unsubstituted aliphatic, alicychc, 
heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety. 

[0034] The term "acyl", as used herein, does not substantially differ from the 

common meaning of this term in the art, and refers to a moiety of structure - 
15 C(0)Rx, wherein Rx is a substituted or unsubstituted, aliphatic, alicychc, 
heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety. 

[0035] The term "imine", as used herein, does not substantially differ from 

the common meaning of this term in the art, and refers to a moiety of structure - 
C(=NRx)Ry, wherein R x is hydrogen or an optionally substituted aliphatic, alicyclic, 
20 heteroaliphatic, heteroalicyclic, aryl or heteroaryl moiety; and Ry is an optionally 
substituted aliphatic, alicyclic, heteroaliphatic, heteroalicyclic, aryl or heteroaryl 
moiety. 

[0036] As used herein, the terms "aliphatic", "heteroaliphatic", "alkyl", 

"alkenyl", "alkynyl", "heteroalkyl", "heteroalkenyl", "heteroalkynyl", and the like 

25 encompass substituted and unsubstituted, saturated and unsaturated, and linear and 
branched groups. Similarly, the terms "alicyclic", "heteroalicyclic", 
heterocycloalkyl", "heterocycle" and the like encompass substituted and 
unsubstituted, and saturated and unsaturated groups. Additionally, the terms 
"cycloalkyl", "cycloalkenyl", "cycloalkynyl", "heterocycloalkyl", 

30 "heterocycloalkenyl", "heterocycloalkynyl", "aryl", "heteroaryl" and the like 
encompass both substituted and unsubstituted groups. 
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[0037] It will be appreciated that additional examples of generally applicable 

substitutents are illustrated by the specific embodiments shown in the Examples 
which are described herein, but are not limited to these Examples. 
[0038] By the term "protecting group", has -used herein, it is meant that a 

5 particular functional moiety, e.g., O, S, or N, is temporarily blocked so that a 
reaction can be carried out selectively at another reactive site in a multifunctional 
compound. In preferred embodiments, a protecting group reacts selectively in good 
yield to give a protected substrate that is stable to the projected reactions; the 
protecting group must be selectively removed in good yield by readily available, 
0 preferably nontoxic reagents that do not attack the other functional groups; the 
protecting group forms an easily separable derivative (more preferably without the 
generation of new stereogenic centers); and the protecting group has a minimum of 
additional functionality to avoid further sites of reaction. As detailed herein, 
oxygen, sulfur, nitrogen and carbon protecting groups may be utilized. For example, 
15 in certain embodiments, as detailed herein, certain exemplary oxygen protecting 
groups are utilized. These oxygen protecting groups include, but are not limited to 
methyl ethers, substituted methyl ethers {e.g., MOM (methoxymethyl ether), MTM 
(methylthiomethyl ether), BOM (benzyloxymethyl ether), PMBM or MPM (p- 
methoxybenzyloxymethyl ether), to name a fe-w), substituted ethyl ethers, 
20 substituted benzyl ethers, silyl ethers (e.g., TMS (trimethylsilyl ether), TES 
(triethylsilylether), TIPS (triisopropylsilyl ether), TBDMS (t-butyldimethylsilyl 
ether), tribenzyl silyl ether, TBDPS (t-butyldiphenyl silyl ether), to name a few), 
esters (e.g., formate, acetate, benzoate (Bz), trifluoroacetate, dichloroacetate, to 
name a few), carbonates, cyclic acetals and ketals. In certain other exemplary 
25 embodiments, nitrogen protecting groups are utilized. These nitrogen protecting 
groups include, but are not limited to, carbamates (including methyl, ethyl and 
substituted ethyl carbamates (e.g., Troc), to name a few) amides, cyclic imide 
derivatives, N-Alkyl and N-Aryl amines, imirie derivatives, and enamine 
derivatives, to name a few. Certain other exemplary protecting groups are detailed 
30 herein, however, it will be appreciated that the present invention is not intended to 
be limited to these protecting groups; rather, a variety of additional equivalent 
protecting groups can be readily identified using the above criteria and utilized in 
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the present invention. Additionally, a variety of protecting groups are described in 
"Protective Groups in Organic Synthesis" Third Ed. Greene, T.W. and Wuts, P.G., 
Eds., John Wiley & Sons, New York: 1999, the entire contents of which are hereby 
incorporated by reference. 

[0039] The term "natural amino acid side chain" as used herein refers to the 

side chain of any one of the common, naturally occurring L-amino acids found in 
naturally occurring proteins: glycine (Gly), alanine (Ala), valine (Val), leucine 
(Leu), isoleucine (He), lysine (Lys), arginine (Arg), histidine (His), proline (Pro), 
serine (Ser), threonine (Thr), phenylalanine (Phe), tyrosine (Tyr), tryptophan (Trp), 
aspartic acid (Asp), glutamic acid (Glu), asparagine (Asn), glutamine (Gin), cysteine 
(Cys) and methionine (Met). 

[0040] The term "unnatural amino acid side chain" as used herein refers to 

the side chain of all amino acids which are not natural amino acids. This includes, 
for example, a-, |K D-, L- amino acid residues, and compounds of the general 


formula " o wherein the side chain R is other than the amino acid side 
chains occurring in nature. 

[0041] More generally, the term "amino acid side chain", as used herein, 

encompasses natural amino acid and unnatural amino acid side chains. 
[0042] As used herein, the term "pharmaceutically useful group or entity" 

refers to a compound or fragment thereof, or an organic moiety which, when 
covalently attached to a peptide or protein, can exert some biological or diagnostic 
function or activity when administered to a subject, or enhance the therapeutic, 
diagnostic or preventive properties of the parent peptide and/or protein in 
biomedical applications, or improve safety, alter biodegradation or excretion, or is 
detectable. Examples of suitable pharmaceutically useful groups or entities include 
hydrophilicity/hydrophobicity modifiers, pharmacokinetic modifiers, biologically 
active modifiers, detectable modifiers. A modifier can have one or more 
pharmaceutical functions, e.g., biological activity and/or pharmacokinetics 
modification. Pharmacokinetics modifiers can include, for example, antibodies, 
antigens, receptor ligands, hydrophilic, hydrophobic or charged groups. Biologically 
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active modifiers include, for example, therapeutic drags and prodrugs, antigens, 
immunomodulators. Detectable modifiers include diagnostic labels, such as 
radioactive, fluorescent, paramagnetic, superparamagnetic, ferromagnetic, X-ray 
modulating, X-ray-opaque, ultrosound-reflective, and otner substances detectable by 
one of available clinical or laboratory methods, e.g., scintigraphy, NMR 
spectroscopy, MRI, X-ray tomography, sonotomography, photoimaging, 
radioimmunoassay. Modifiers can be small molecules or macromolecules, and can 
belong to any chemical or pharmaceutical class, e.g., nucleotides, chemotherapeutic 
agents, antibacterial agents, antiviral agents, immunomodulators, hormones or 
analogs thereof, enzymes, inhibitors, alkaloids and therapeutic radionuclides. Viral 
and non-viral gene vectors are considered to be a phanrxaceutically useful entity or 
group. 

[0043] The term "biomolecules", as used herein, refers to molecules (e.g., 

proteins, amino acids, peptides, polynucleotides, nucleotides, carbohydrates, sugars, 
lipids, nucleoproteins, glycoproteins, lipoproteins, steroids, etc.) which belong to 
classes of chemical compounds, whether naturally-occurring or artificially created 
(e.g., by synthetic or recombinant methods), that are commonly found in cells and 
tissues. Specific classes of biomolecules include, but are not limited to, enzymes, 
receptors, neurotransmitters, hormones, cytokines, cell response modifiers such as 
> growth factors and chemotactic factors, antibodies, vaccines, haptens, toxins, 
interferons, ribozymes, anti-sense agents, plasmids, DNA, and RNA. 
[0044] As used herein, the term "small molecule" refers to molecules, 

whether naturally-occurring or artificially created (e.g., via chemical synthesis) that 
have a relatively low molecular weight. Preferred small molecules are biologically 
5 active in that they produce a local or systemic effect in animals, preferably 
mammals, more preferably humans. Typically, small irxolecules have a molecular 
weight of less than about 1500 g/mol. In certain preferred embodiments, the small 
molecule is a drug. Preferably, though not necessarily, the drug is one that has 
already been deemed safe and effective for use by the appropriate governmental 
50 agency or body. For example, drugs for human use listed by the FDA under 21 
C.F.R. §§ 330.5, 331 through 361, and 440 through 460; drugs for veterinary use 
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listed by the FDA under 21 C.F.R. §§ 500 through 589, incorporated herein by 
reference, are all considered suitable for use with the present hydrophilic polymers. 
[0045] Classes of small molecule drugs that can be used in the practice of the 

present invention include, but are not limited to, vitamins, anti-AIDS substances, 
anti-cancer substances, antibiotics, immunosuppressants, anti-viral substances, 
enzyme inhibitors, neurotoxins, opioids, hypnotics, anti-histamines, lubricants, 
tranquilizers, anticonvulsants, muscle relaxants and anti-Parkinson substances, 
anti-spasmodics and muscle contractants including channel blockers, miotics and 
anti-cholinergics, anti-glaucoma compounds, anti-parasite and/or anti-protozoal 
compounds, modulators of cell-extracellular matrix interactions including cell 
growth inhibitors and anti-adhesion molecules, vasodilating agents, inhibitors of 
DNA, RNA or protein synthesis, anti-hypertensives, analgesics, anti-pyretics, 
steroidal and non-steroidal anti-inflammatory agents, anti-angiogenic factors, anti- 
secretory factors, anticoagulants and/or antithrombotic agents, local anesthetics, 
ophthalmics, prostaglandins, anti-depressants, anti-psychotic substances, 
anti-emetics, imaging agents . Many large molecules are also drugs. 
[0046] A more complete, although not exhaustive, listing of classes and 

specific drugs suitable for practicing the present invention may be found in 
"Pharmaceutical Substances: Syntheses, Patents, Applications" by Axel Kleemaim 
, and Jurgen Engel, Tbieme Medical Publishing, 1999 and the "Merck Index: An 
Encyclopedia of Chemicals, Drugs, and Biologicals", Edited by Susan Budavari et 
al, CRC Press, 1996, both of which are incorporated herein by reference. 
[0047] As used herein, the term "macromolecules" refers to molecules, 

whether naturally-occurring or artificially created {e.g., via chemical synthesis) that 
5 have a relatively high molecular weight, e.g., generally above 1500 g/mole. 
Preferred macromolecules are biologically active in that they exert a biological 
function in animals, preferably mammals, more preferably humans. Examples of 
macromolecules include proteins, enzymes, growth factors, cytokines, peptides, 
polypeptides, polylysine, proteins, lipids, polyelectrolytes, immunoglobulins, DNA, 
l0 RNA, ribozymes, plasmids, and lectins. For the purpose of this invention, 
supramolecular constructs such as viruses and protein associates {e.g., dimers) are 
considered to be macromolecules. When covalently attached to a peptide or protein, 
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a macromolecule may be chemically modified prior to being covalently attached to 
said peptide or protein. 

[0048] As used herein, the term "diagnostic label" refers to an atom, group 

of atoms, moiety or functional group, a nanocrystal, or other discrete element of a 
5 composition of matter, that can be detected in vivo or ex vivo using analytical 
methods known in the art. When covalently attached to a peptide or protein, such 
diagnostic labels permit the monitoring of the peptide or protein in vivo. On the 
other hand, constructs and compositions that include diagnostic labels can be used to 
monitor biological functions or structures. Examples of diagnostic labels include, 
l0 without limitations, labels that can be used in medical diagnostic procedures, such 
as, radiopharmaceutical or radioactive isotopes for gamma scintigraphy and Positron 
Emission Tomography (PET), contrast agent for Magnetic Resonance Imaging 
(MRI) (for example paramagnetic atoms and superparamagnetic nanocrystals), 
contrast agent for computed tomography, contrast agent for X-ray imaging method, 
15 agent for ultrasound diagnostic method, agent for neutron activation, and moiety 
which can reflect, scatter or affect X-rays, ultrasounds, radiowaves and microwaves, 
fluorophores in various optical procedures, etc. 

Brief Description of the Drawing 
20 [0049] Figure 1 depicts examples of bifunctional glycopeptides {i.e., 

diglycosylated peptides) containing O-linkages preparaed by the method of the 
invention. 

[0050] Figure 2 depicts anomeric p-glycosylamines (22 and 23) that differ at 

a single stereocenter (asterisks), and a bifunctional glycopeptide (24) containing one 
25 of these structurally complex glycans. 

DETAILED DESCRIPTION OF CERTAIN PREFERRED EMBODIMENTS OF THE 

Invention 

[0051] In one aspect, the present invention provides novel methodologies for 

30 the synthesis of peptides and/or proteins functionalized at two or more non-adjacent 
amino acid sites {e.g., bearing a pharmaceutical^ useful group or entity covalently 
attached to at least two non-adjacent amino acid residues of the polypeptide/protein 
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chain). In certain embodiments, the invention provides a system for the synthesis of 
polypeptides and proteins bearing at least two glycans co-valently attached to non- 
adjacent amino acid residues of the polypeptide/protein chain. In certain 
embodiments, in the context of synthetic studies developed for the total synthesis of 
glycosylated fragments of erythropoietin, generalized methodologies were 
developed for the improved synthesis of polyglycosylated peptides and proteins. 
This general synthetic method encompasses the realization that native chemical 
ligation (NCL) is a glycan-compatible process that may be used for assembling large 
multiply glycosylated polypeptides from glycopeptide building blocks. In yet 
another aspect, the present invention also provides the recognition that 
chemoselective reaction of a glycopeptide containing a C-terminal cysteine with a 
glycopeptide thioester may be achieved to from the corresponding diglycosylated 
peptide adduct. 

[0052] Specific examples, particularly with respect to the synthesis of 

bifunctional glycopeptides (/.*., diglycosylated peptides), are described in more 
detail below and in the Exemplification herein, along with certain general 
methodologies developed during the course of these syntheses. It will be 
appreciated by one of ordinary skill in the art that these examples are not intended to 
be limiting; rather all equivalents are intended to be incorporated into the scope of 

) the present invention. In particular, the inventive method may be generally adapted 
to the preparation of polyfunctionalized peptides and proteins. 
[0053] 1) Description of certain embodiments of the inventive method 

[0054] In one aspect of the present invention, there is provided a method for 

preparing a polyfunctionalized peptide comprising a peptidic backbone made up of 

.5 four or more amino acids wherein two or more non-adjacent amino acids are 
independently substituted with a moiety having the structure: 


wherein each occurrence of L 1 is independently a substituted or 
unsubstituted, linear or branched, cyclic or acyclic, saturated or unsaturated aliphatic 
or heteroaliphatic moiety; 
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each occurrence of A is independently an aliphatic, heteroaliphatic, 
aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutically useful group or 
entity; 

with the proviso that the peptide sequence between any two consecutive, 
5 non-adjacent, amino acids bearing a A-L 1 - moiety comprises at least one cysteine 
residue. 

In one aspect of the present invention, there is provided a method for 
preparing a polyfunctionalized peptide comprising a peptidic backbone made up of 
four or more amino acids wherein two or more non-adjacent amino acids are 
10 independently substituted with a moiety having the structure: 

with the proviso that the peptide sequence between any two consecutive, 
non-adjacent, amino acids bearing a A-L 1 - moiety comprises at least one cysteine 
residue; 

1 5 wherein the method comprises a step of: 

reacting a peptide acyl donor comprising a peptidic backbone made up of 
two or more amino acids wherein said peptide acyl donor has the structure: 

/, Peptide Backbone k O R X0 
r*W 1 Y 

O ; 

with a peptide amine acceptor having the structure: 



H 2 N 

6 

under suitable conditions to effect ligation; 

wherein kl and k2 are independently integers between 1 and about 20; 
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each occurrence of A, Ai and A 2 is independently an aliphatic, 
heteroaliphatic, aromatic, heteroaromatic, aryl, heteroaryl or a pharmaceutical^ 
useful group or entity; 

R S1 is a sulfide protecting group; 

R xo is a group such that the moiety -C(=O)OR x0 can be made to undergo 
ligation with the peptide amine acceptor; 

each occurrence of L 1 is independently a substituted or unsubstituted, linear 
or branched, cyclic or acyclic, saturated or unsaturated aliphatic or heteroaliphatic 
moiety; 

R X1 is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid; and 

R 50 is -OR* 2 * or -NR^R* 20 , wherein R* 23 is hydrogen, alkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a^arboxylk acid 
protecting group, an amino acid or a proctected amino acid; and R 30 " and R* 20 are 
independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 
alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 
proctected amino acid. 

[0055] In certain embodiments, each occurrence of A is independently a 

) pharmaceutical^ useful group or entity. In certain embodiments, each occurrence of 
A is independently a biomolecule, a small molecule, a macromolecule or a 
diagnostic label. 

[0056] In certain exemplary embodiments, each occurrence of A is 

independently a carbohydrate determinant having the structure: 


wherein a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
the proviso that the x, y and z bracketed structures represent furanose or pyranose 
moieties and the sum of b and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 
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g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
wherein Ro is hydrogen, a linear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R 2 , Rs, R4, Rs, Rs , R7, R* and R 9 is 
independently hydrogen, OH, OR', NHR J , NHCOR', F, CH 2 OH, CH 2 OR', a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R' is independently hydrogen, CHO, COOR u , or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 
saccharide moiety having the structure: 
R12 



wherein Y and Z are independently NH or O; wherein k, 1, r, s, t, u, v and w 
are each independently 0, 1 or 2; with the proviso that the v and w bracketed 
structures represent furanose or pyranose moieties and the sum of 1 and k is 1 or 2, 
and the sum of s and u is 1 or 2, and with the proviso that v and w are not 
simultaneously 0; wherein R'o is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rio, R11, R12, R13, Ru and R15 is 
independently hydrogen, OH, OR iU , NHR in , NHCOR 1 ", F, CH 2 OH, CH 2 OR iH , or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R lfi is hydrogen, COOH, COOR", CONHR", a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occurrence 
of R Hi is hydrogen, CHO, COOR iv , or a substituted or unsubstituted linear or 
branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each occurrence of 
R n and R iv are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group. 

[0057] In certain embodiments, L 1 may comprise any functional moiety that 

is compatible with native chemical ligation reaction conditions. In certain 
embodiments, L 1 may comprise any functional moiety that is compatible with 
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aqueous conditions. In certain embodiments, a compatible functionality is one that 
is stable, unreactive and/or minimally interferes with the reaction. A thiol group is 
considered a compatible functionality, even though a thiol group may slow down the 
reaction. Examples of suitable functionalities include, but are not limited to, 
hydrocarbons, amines, amides, imines, hydroxyls, ethers, carboxylic esters, 
aldehydes, thiols, olefins, alkynes, aryls and heteroaryls. In certain exemplary 
embodiments, L 1 does not comprise a thiol group. 

[0058] In certain embodiments, each occurrence of L 1 is independently -O- 

(CH 2 ) n -, wherein n is 0-9, or a glycoside-containing moiety (e.g mono- or poly- 
saccharide). 

[0059] In certain embodiments, each occurrence of L 1 is -0-(CH 2 ) n -CH 2 - 

and two or more non-adjacent amino acids is/are independently substituted with a 
moiety having the structure: 

wherein each occurrence of n is independently 0-8; and A is as defined 
above. In certain other embodiments, A is a carbohydrate moiety and each 
occurrence of the n-alkyl glycosidic moiety is independently either a- or p-linked to 
an amino acid residue of the backbone. It will be appreciated that 
polyfunctionalized peptides made according to the method of the present invention 
are not limited to those where each occurrence of L 1 comprises n-alkyl where n is 
greater than or equal to 1; rather each occurrence of A can be independently linked 
via the traditional direct linkage (n=0), via n-alkyl (such as pentyl), via a mono- 
saccharide moiety or any combination thereof. In preferred embodiments, each 
occurrence of A is independently selected from the group consisting of Globo-H, 
; fucosyl GM1, KH-1, glycophorin, STN, (2,3)ST, Le y , Le x , N3, Tn, 2,6-STn, Gb3 
andTF. 

[0060] In certain embodiments, occurrences of A may be the same or 

different. 

[0061] In certain other embodiments, certain occunences of A are clustered. 

D For example, in certain embodiments, the polyfunctionalized peptide comprises at 
least two adjacent functionalized amino acids (i.e., clustered functionalized amino 
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acids), separated from another functionalized site by a peptide amino acid sequence 
comprising at least one cysteine residue. 

[0062] In certain other embodiments, certain occurrences of A are clustered 

glycosides. For example, in certain embodiments, the multiglycosylated peptide 
comprises at least two adjacent glycosylated amino acids (i.e., clustered glycosylated 
amino acids), separated from another glycosylated site by a peptide amino acid 
sequence comprising at least one cysteine residue. 

[0063] In certain embodiments, the present invention provides a method for 

preparing a polyfunctionalized peptide having the structure: 


R X1 



R P2 


R P3 


sl and s2 are independently an integer from 1 to about 20; 
tl, t2 and t3 are each independently an integer; 

R xl is hydrogen, alkyl, acyl, aromatic, heteroaromatic, aryl, heteroaryl, - 

alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 

proctected amino acid; 

R X2 ig _ OR x2a or _ NR x2b R X2c^ wherein R x2a is ny drogen s alkyl, aromatic, 

heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), a carboxylic acid 

protecting group, an amino acid or a proctected amino acid; and R 3Qb and R* 20 are 

independently hydrogen, alkyl, aromatic, heteroaromatic, aryl, heteroaryl, - 

alkyl(aryl), -alkyl(heteroaryl), a nitrogen protecting group, an amino acid or a 

proctected amino acid; 

R pl , R P2 and R P3 are independently H, alkyl, heteroalkyl, aromatic, 
heteroaromatic, aryl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain; 

each occurrence of L 1 is independently a substituted or unsubstituted 
aliphatic or heteroaliphatic moiety; 
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Ai and A 2 are each independently an aliphatic, heteroaliphatic, aromatic, 
heteroaromatic, aryl, heteroaryl or a pharmaceutically useful group or entity; and 

at least one occurrence of the bracketed structure t2 is a cysteine residue or 
protected cysteine residue; 
5 wherein the method comprises a step of: 

reacting a peptide acyl donor having the structure: 


"OR XO 


with a peptide amine acceptor having the structure: 



10 under suitable conditions to effect ligation; 

wherein the sum t+t' equals (t2)+l; R S1 is a sulfide protecting group; and R xo 
is a group such that the moiety -C(=O)OR x0 can be made to undergo ligation with 
the glycopeptide amine acceptor. 

In certain exemplary embodiments, Ai and A 2 are each independently a 
15 carbohydrate domain having the structure: 


L Jy R-i R 3 


wherein a, b, c, d, e, f, g, h, i, x, y and z are independently 0, 1, 2 or 3, with 
; proviso that the x, y and z bracketed structures represent furanose or pyranose 
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moieties and the sum of b and c is 1 or 2, the sum of d and f is 1 or 2, and the sum of 
g and i is 1 or 2, and with the proviso that x, y and z are not simultaneously 0; 
wherein R 0 is hydrogen, a linear or branched chain alkyl, acyl, arylalkyl or aryl 
group; wherein each occurrence of Ri, R 2 , R 3 , R* Rs, R* , R* Rs and R 9 is 
independently hydrogen, OH, OR', NHR\ NHCOR', F, CH 2 OH, CH 2 OR', a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
tri)hydroxyalkyl 3 (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R l is independently hydrogen, CHO, COOR s \ or a substituted or 
unsubstituted linear or branched chain alkyl, acyl, arylalkyl or aryl group or a 
saccharide moiety having the structure: 


R12 



wherein Y and Z are independently NH or O; wherein k, 1, r, s, t, u, v and w 
are each independently 0, 1 or 2; with the proviso that the v and w "bracketed 
structures represent furanose or pyranose moieties and the sum of 1 and k is 1 or 2, 
15 and the sum of s and u is 1 or 2, and with the proviso that v and w are not 
simultaneously 0; wherein R'o is hydrogen, a linear or branched chain alkyl, acyl, 
arylalkyl or aryl group; wherein each occurrence of Rio, Rn, Ri& Rn, Rm and Ris is 
independently hydrogen, OH, OR®, NHR* NHCOR'", F, CH 2 OH, CH 2 OR'», or a 
substituted or unsubstituted linear or branched chain alkyl, (mono-, di- or 
20 tri)hydroxyalkyl, (mono-, di- or tri)acyloxyalkyl, arylalkyl or aryl group; wherein 
each occurrence of R 16 is hydrogen, COOH, COOR", CONHR«, a substituted or 
unsubstituted linear or branched chain alkyl or aryl group; wherein each occurrence 
of R iH is hydrogen, CHO, COOR iv , or a substituted or unsubstituted linear or 
branched chain alkyl, acyl, arylalkyl or aryl group; and wherein each occxirrence of 
25 R" and R iv are each independently H, or a substituted or unsubstituted linear or 
branched chain alkyl, arylalkyl or aryl group; and wherein each glycoside moiety is 
either a- or (Winked to an amino acid. 
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0 



l0 064] In certain embodiments, the step may be carried out once, or repeated 

a desired number of times, to prepare a polyfunctioned peptide having the 
structure: 

U 


R X2 


5 wherein R xl and R* 2 are as defined above; 

each occurrence of A may be the same or different and may be as defined for 
Ai and A 2 above; 

each occurrence of R pl may be the same or different and may be as defined 
for R P1 and R P2 above; 
10 q is an integer greater than or equal to 2; 

each occurrence of s is independently an integer from 1 to about 20; 
each occurrence oft is independently an integer; 
t0 is an integer; and 

each occurrence of R P0 is independently H, alkyl, heteroalkyl, aromatic, 
15 heteroaromatic, axyl, heteroaryl, -alkyl(aryl), -alkyl(heteroaryl), or a natural or non- 
natural amino acid side chain. 

[00651 In certain embodiments, q is an integer between 2 and about 5. In 

certain embodiments, q is an integer between 2 and about 10. In certain 
embodiments, q is an integer between 2 and about 15. In certain embodiments, q is 
20 an integer between 2 and about 20. In certain embodiments, q is an integer between 
2 and about 25. In certain embodiments, q is an integer between 2 and about 30. In 
certain embodiments, q is an integer greater than 30. In certain embodiments, q is 2. 
[00661 In certain embodiments, the sum s+t is between about 2 and about 6. 

In certain embodiments, the sum s+t is between about 2 and about 10. In certain 
25 embodiments, the sum s+t is between about 2 and about 15. In certain 
embodiments, the sum s+t is between about 2 and about 20. In 
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embodiments, the sum s+t is between about 2 and about 50. In certain 
embodiments, the sum s+t is between about 2 and about 100. In certain 
embodiments, the sum s+t is between about 2 and about 150. In certain 
embodiments, the sum s+t is between about 2 and about 200. In certain 
5 embodiments, the sum s+t may be greater than 200. 

[0067] In certain embodiments, tO is an integer from 0 to about 2. In certain 

embodiments, tO is an integer from 0 to about 5. In certain embodiments, tO is an 
integer from 0 to about 10. In certain embodiments, tO is an integer from 0 to about 
15. In certain embodiments, tO is an integer from 0 to about 20. In certain 
10 embodiments, tO is an integer from 0 to about 25. In certain embodiments, tO is an 
integer from 0 to about 30. In certain embodiments, tO is an integer from 0 to about 
50. In certain embodiments, tO is an integer from 0 to about 100. In certain 
embodiments, tO is an integer from 0 to about 150. In certain embodiments, tO is an 
integer from 0 to about 200. In certain embodiments, tO is an integer greater than 
15 200. 

[0068] In certain other embodiments, R X1 is hydrogen, Fmoc or Ac. 

[0069] In certain other embodiments, is NH 2 . 

[0070] In certain other embodiments, R xo is a sulfur-substituted aryl moiety. 

In certain embodiments, R xo is a disulfide-substituted aryl moiety. In certain 
20 embodiments, R x0 has the structure: 



4-SSR 


wherein R is an aliphatic, heteroaliphatic, aromatic or heteroaromatic moiety. 
In certain exemplary embodiments, R xo has the structure: 



RSS 

[0071] In certain embodiments, R is lower alkyl. In certain exemplary 

embodiments, R is ethyl. 

[0072] In certain other embodiments, R sl is -S/Bu. 
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[0073] In certain exemplary embodiments, in the step of reacting the peptide 

acyl donor having the structure: 

0 


"OR X0 


R p1 


R p2 


with the peptide amine acceptor under suitable conditions to effect ligation, 
a intermediate having the following structure is formed in situ: 


R P1 


R p2 


- SR X0a 


wherein R XOa is an oxygen-substituted aryl moiety. 
[0074] In certain embodiments, the suitable conditions to effect ligation 

comprise MES-Na. 

10 [0075] In certain exemplary embodiments, in the peptide acyl donor having 

the structure: 

0 


R X1 


the amino acyl residue directly attached to -OR X0 is phenylalanine. 
[0076] In certain embodiments, A may comprise any functional moiety that 

15 is compatible with native chemical ligation reaction conditions. In certain 
embodiments, A may comprise any functional moiety that is compatible with 
aqueous conditions. In certain embodiments, a compatible functionality is one that 
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is stable, unreactive and/or minimally interferes with the reaction. A thiol group is 
considered a compatible functionality, even though a thiol group may slow down the 
reaction. Examples of suitable functionalities include, but are not limited to, 
hydrocarbons, amines, amides, imines, hydroxyls, ethers, carboxylic esters, 
aldehydes, thiols, olefins, alkynes, aryls and heteroaryls. In certain exemplary 
embodiments, A does not comprise a thiol group. 

[0077] In certain embodiments, when at least one occurrence of A (or Ai 

and/or A 2 , as further defined for A) is a carbohydrate domain, some or all of 
carbohydrate domains are O-linked to the peptide backbone. In certain other 
embodiments, when at least one occurrence of A (or A, and/or A 2 , as further defined 
for A) is a carbohydrate domain, some or all of carbohydrate domains are N-linked 
to the peptide backbone. In certain other embodiments, when at least one 
occurrence of A (or A, and/or A 2 , as further defined for A) is a carbohydrate 
domain, the inventive method may be praticed while the carbohydrate domain is 
partially or fully deprotected (i.e., comprises exposed OH groups). In certain 
embodiments, the peptide sequence between each point of attachment of the A 
moieties comprises a cysteine residue. In certain embodiments, the 
polyfunctionalized construct {i.e., construct bearing more than one A moiety) is 
prepared by Native Chemical Ligation. In certain embodiments, the 
i polyfunctionalized peptides obtained by the inventive method are symmetrical 
(functionalized peptide building blocks to be ligated have the same peptide sequence 
(safe for N- and/or C-terminals) and bear the same functionalization pattern), 
nonsymmetrical (functionalized peptide building blocks to be ligated differ in 
peptide sequence (excluding N- and C-terminals) and/or functionalization pattern). 
5 In certain embodiments, the inventive method allows the preparation of 
multiglycosylated peptides designed to approximate the spatial positions) of 
carbohydrate(s) in glycoprotein/glycopeptides of interest (e.g., naturally occurring 
glycoproteins such as gpl20 and erythropoietin). 

[0078] In certain embodiments, the method further comprises a step of 

>0 conjugating the polyfunctionalized peptide to an immunogenic carrier. In certain 
exemplary embodiments, the carrier is a protein, a peptide or a lipid. In certain other 
exemplary embodiments, the carrier is Bovine Serum Albumin (BSA), Keyhole 
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Limpet Hemocyanin (KLH) or polylysine. In certain other embodiments, the carrier 
is a lipid carrier having the structure: 



wherein m, n and p are each independently integers between about 8 and 20; 
and R v is hydrogen, substituted or unsubstituted linear or branched chain lower alkyl 
or substituted or unsubstituted phenyl. In certain exemplary embodiments, m\ n' 
and p' are each 14 and the lipid is tripalrnitoyl-S-glycerylcysteinylserine {e.g., 
PamCys). 

[0079] It will be appreciated that the carrier can be linked to the 

polyfunctionalized peptide either directly or through a crosslinker, and thus the 
peptide may be attached to a protein, peptids, or lipid, as well as a (crosslinker- 
protein), (crosslinker-peptide) and (crosslinker-lipid) moiety. 

[0080] Crosslinkers suited to the invention are widely known in the art (see, 

for example, 1994 Pierce Technical Handbook: cross-linking (Appendix A), which 
is available at http://www.piercenet.com/ resources/browse.cfm?fldII>=184), 
including bromoacetic NHS ester, 6-(iodoacetamido)caproic acid NHS ester, 
maleimidoacetic acid NHS ester, maleimidobenzoic acide NHS ester, etc. In certain 
preferred embodiments, the crosslinker is MMCCH (4-(maleimidomethyl) 
cyclohexane-l-carboxyl hydrazide). In certain other preferred embodiments, the 
crosslinker is MBS (m-maleimidobenzoyl acid N-Hydroxysuccinimidyl ester). In 
certain embodiments, the crosslinker is a fragment having the structure: 
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whereby said structure is generated upon conjugation of a maleimidobenzoic 
acid N-hydroxy succinimide ester with a suitable functionality on the 
polyfunctionalized peptide. 

[0081] 2) Exemplary Synthetic Methodology 

[0082] Native Chemical Ligation 

[0083] One of the more widely used methods for the synthesis of 

glycopeptides is native chemical ligation (NCL). 48 First reported by Kent in 1994, 
NCL allows for the assembly of large proteins with native amide bonds from 
unprotected peptide building blocks (Scheme 1). Furthermore, the reaction is mild, 
selective, and compatible with the presence of glycans. When glycans are present in 
the reaction, they are typically found on the ' C-terminal side. In the event, a 
glycopeptide containing a C-terminal cysteine undergoes a chemoselective reaction 
with a peptide thioester. The resulting peptide thioester then rearranges 
spontaneously to furnish a native peptide bond, effectively lengthening the peptide 
backbone of the glycopeptide. 

[0084] Scheme 1. Exemplary Native Chemical Ligation methodology 

0 o <Psa® chemoselective HzN^J ^g^g^ 

spontaneous 


[0085] As discussed above, glycopeptides bearing multiple glycosidic 

domains are of particular interest. Such structures include the synthesis of g P 120 
fragments, erythropoietin (EPO), human chorionic gonadotropin (HCO), and 
amyloid precursor protein (APP). Drawing from previous studies in the area, 49 NCL 
was explored as a way to couple two small glycopeptides to form one large 
glycopeptide. One drawback that became apparent lies in the difficulty with which 
unprotected glycopeptide a-thioesters are synthesized. 50 Accordingly, a new 
method for the synthesis of glycopeptide a-thioesters, or their equivalent, was 
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devised, to allow the assembly of glycopeptides bearing multiple glycosidic 
domains. 

[0086] A closer look at the reaction mechanism of NCL provided clues as to 

how mis problem might be approached. In one embodiment of the reaction itself, 
me sulfhydryl group of the cysteine coupling partner, initially protected as a t- 
butyldisulfide, is released by disulfide exchange with 2-mercaptoethanesulfonic acid 
(MESNa) and then undergoes trans-thioesterification as mentioned above. Without 
wishing to be bound to any particular theory, it was thought that it might be possible 
to mask the a-thioester as a phenyl ester, bearing an ethyldisulfide in the ortho 
position (Scheme 2) by taking advantage of this disulfide exchange. Under the 
reaction conditions, the free sulfhydryl would be released and undergo a 
spontaneous rearrangement to furnish a thioester capable of entering into the NCL 
pathway. 

[0087] Scheme 2. A new method for the generation of peptide a-thioesters. 


NCL: 


Modification: 


^ S i SEt 'BuSS^ " s 
"activated" I 

disulfide 

[0088] To test the viability of this proposed modification, the simple 

dipeptide, PheCys, was synthesized. The desired orrto-substituted phenol was 
synthesized in two steps from commercially available 2-mercaptophenol by 
oxidation and disulfide exchange. Coupling of the phenol to phenylalanine followed 
by the removal of the Boc protecting group set the stage for our first attempt 
(Scheme 3). The reaction proceeded in good yield. More importantly no 
racemization was observed under the reaction conditions. 
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[0089] Scheme* Exemplary NCL with masked thioester 

1) Boc-L-Phe-OH, 

^.OH 1) > 2 ,MeOH,H 2 Q f^°" EDCI, DMAP 

l^A SH 2) BF 3 *OEt,(EtS)r ^^SSEt 


2) TFA, Et 3 SiH 
H z O 


O 


h 2 n^y B y^ oh 

SSEt pH7.5 0 ^SH 



78% yield 
No observed 
racemization 


[00901 The synthesis of glycopeptides bearing two glycosidic domains was 

then undertaken. Synthesis of the tf-terminal fragment (the fragment that would 
ultimately become the masked thioester) began with the solid phase synthesis of a 
peptide acid, followed by standard peptide coupling to introduce the phenylalanine 
bearing the phenyl ester. TFA deprotection of the side chain protecting groups 

furnished the peptide backbone for the ^-terminal fragment (Scheme 4). 

[0091] Scheme 4. Exemplary synthesis of the peptide backbone of the N~ 

terminal fragment. 

I SEt 36%fromSPPS NH NH 
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[0092] Coupling of an oligosaccharide with excess peptide furnished a fully 

deprotected glycopeptide ready for ligation (Scheme 5). 

[0093] Scheme 5. Exemplary approach to the completion of the iV-terminal 

fragment 


OH 


|pB90MCchjr^ -NH 2 


'C " °( 

NH NH 
HN^NHj HN^NH 



[0094] With the JV-terminal portion in hand, the synthesis of the C-terminal 

coupling partner was then carried out. In a similar way, the peptide backbone was 
10 prepared by solid phase peptide synthesis. Upon cleavage from the resin, 
glycosylation and Fmoc removal, the C-terminal glycopeptide was obtained 
(Scheme 6). 

[0095] Scheme 6. Exemplary synthesis of the C-terminal glycopeptide 
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Ssteu ° Jy» ° S>H 

[0096] In certain exemplary embodiments, the synthesis of glycopeptides 

bearing two glycosydic domains was accomplished as follows: equal molar amounts 
of both the JV-terminal and C-terrninal glycopeptides were combined in a LCMS vial 
and to this was added a solution of MESNa in phosphate buffered saline. The 
reaction was monitored by LCMS and to our amazement, the rearrangement of the 
phenyl ester to thioester occurred quite rapidly, 51 additionally mass peaks 
corresponding to the desired product were observed almost instantly. Upon 
completion of the reaction, any remaining disulfide bonds were reduced by the 
action of tris(2-carboxyethyl) phosphine hydrochloride (TCEP). The crude reaction 
mixture was then subjected to HPLC for purification. The reaction proceeded 
extremely well, providing doubly glycosylated glycopeptides, typically in 60-70% 
yield. Exemplary di-glycosylated peptides that may be prepared by the method of 
the present invention are detailed in the Exemplification below. It should be noted 
that symmetrical, nonsymmetrical and mixed (^-linked and O-linked) glycopeptides 
can be used. 

[0097] Accordingly, there is provided herein a method for preparing 

glycopeptides comprising at least two carbohydate domains covalently attached 
thereto. In certain embodiments, some or all of carbohydrate domains are O-linked. 
In certain other embodiments, some or all of carbohydrate domains are N-linked. In 
certain embodiments, the glycopeptide comprises two or more carbohydate domains 
covalently attached thereto, wherein the glycopeptide sequence between each point 
of attachment of the carbohydrate domains comprises a cysteine residue. In certain 
embodiments, the mutli-glycan glycopeptide is prepared by Native Chemical 
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Ligation. In certain embodiments, the method allows for coupling where each 
coupling partner is a glycopeptide itself. Symmetrical, nonsymmetrical and mixed 
(N-linked and O-linked) glycopeptides can be obtained. In certain embodiments, the 
method involves the in situ generation of a thioester that is then used immediately in 
native chemical ligation. 
[0098] Peptide Thioester Synthesis 

[0099] Several methods have been developed for peptide thioester synthesis, 

including the original "Boc chemistry" (Boc = f^-butyloxycarbonyl) method (See, 
for example, (1) Canne, L. E.; Walker, S. M.; Kent, S. B. H. "A General Method for 
the Synthesis of Thioester Resin Linkers for Use in the Solid-Phase Synthesis of 
Peptide Alpha-Thioacids." Tetrahedron Lett. 1995, 36, 1217-1220; and (2) Hojo, H.; 
Aimoto, S. "Polypeptide Synthesis Using the S-Alkyl Thioester of a Partially 
Protected Peptide Segment-Synthesis of the DNA- Binding Domain of C-Myb 
Protein (142-193)-NH2." Bull. Chem. Soc. Jpn. 1991, 64, 111-117) and several 
Fmoc-compatible systems (See, for example, (1) Shin, Y.; Winans, K. A; Backes, 
B. J.; Kent, S. B. H.; Ellman, J. A.; Bertozzi, C. R. "Fmoc-based synthesis of 
peptiae-(al P ha)thioesters: Application to the total chemical synthesis of a 
glycoprotein by native chemical ligation." J. Am. Chem. Soc. 1999, 121, 11684- 
11689; (2) Ingenito, R.; Bianchi, E.; Fattori, D.; Pessi, A. "Solid phase synthesis of 
0 peptide C-terminal thioesters by Fmoc/t-Bu chemistry." J. Am. Chem. Soc. 1999, 
12U 11369-11374; (3) Li, X. Q.; Kawakami, T.; Aimoto, S. "Direct preparation of 
peptide thioesters using an Fmoc solidphase method." Tetrahedron Lett. 1998, 39, 
8669-8672; (4) Clippingdale, A. B.; Barrow, C. J.; Wade, J. D. "Peptide thioester 
preparation by Fmoc solid phase peptide synthesis for use in native chemical 
>5 ligation." J. Pept. Sci. 2000, 6, 225-234; and (5) Bu, X. Z, Xie, G. Y.; Law, C. W.; 
Guo Z. H. "An improved deblocking agent for direct Fmoc solidphase synthesis of 
peptide thioesters." Tetrahedron Lett. 2002, 43, 2419-2422). In ceratin 
embodiments, the model thioester is a C-tenninal glycine thioester, which is locally 
achiral and cannot be epimerized, and is therefore easy to synthesize. Though the 
30 desired GP120 thioester contains an epimerization-prone C-terminal histidine (His) 
residue, such thioesters have been synthesized previously and have in fact been 
shown to modulate favorably the rate of NCL (See, for example, Hackeng, T. M.; 


Page 39 of 74 


W 0 2005/044841 PCT/OS2004/029047 

Griffin J H.; Dawson, P. B. "Protein synthesis by native chemical ligation: 
Expanded scope by using straightforward methodology." tree. Natl. Acad. Set V. S. 
A 1999, 96, 10068-10073). 

[00100] It is understood that, although the discussion above regarding Native 
5 Chemical Ligation focuses on the preparation of multiglycosylated peptides, the 
method may readily be adapted to other polyfunctioned peptides and/or protems. 
For example, peptides and/or proteins functionalized at more than one ammo actd 
site with a pharmaceutical useful group or entity may be prepared by the method 
of the invention. 

10 

Equivalents 

[00101] The representative examples which follow are intended to help 
illustrate the invention, and are not intended to, nor should they be construed to, 
limit the scope of the invention Indeed, various modifications of the inventton and 
15 many farther embodiments thereof, in addition to those shown and described herem, 
will become apparent to those skilled in the art from the full contents of fins 
document, including the examples which follow and the references to the scienrific 
and patent literature cited herein. In but one iUustrative example, protecting groups 
play an important role in me synthesis of the carbohydrate domains and synthefio 
20 conjugates, as described herein; however i, will be appreciated by one of ordmary 
skill in the art that the present invention encompasses the use of various alternate 
protecting groups known in the art. Those protecting groups used in the disclosure 
including the Examples below are merely illustrative. 

[00102] It should further be appreciated that, uless otherwise indicated, the 
25 contents of those cited references are incorporated herein by reference to help 
illustrate the state of the art. The following examples contain important addmonal 
information, exemplification and guidance which can be adapted to the practice of 
tins invention in its various embodiments and the equivalents thereof. 

30 Exemplification 

[00103] The method of this invention can be understood further by the 
examples that illustrate some of the processes by which the inventive method may 
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be practice. It will be appreciated, however, that these examples do not limit the 
invention. Variations of tte invention, now known or tether developed, are 
considered to fall within the scope of the present invention as described herem and 

as hereinafter claimed. 
5 [00104] X) General Description of Synthetic Methods: 

[001051 The practitioner has a a well-established literature of peptide, protein 
and glycoside chemistry to draw npon, in combination with the informarion 
contained herein, for guidance on synthetic strategies, protecting groups, and other 
materials and methods useful for practicing the method of this invention. 
,0 [00106] The various references cited herein provide helpful background 
information on preparing complex glycosides, glycosylated peptides and other 
glycosylated constructs that may be applied and/or adapted to the method of the 
present invention. 

[001071 According to the present invention, any available techniques can be 
15 naed to practice the method of the invention. For example, a variety of solutton 
phase synthetic methods such as those discussed in detail below may be used. 
Alternatively or additionally, a variety combinatorial techniques, parallel synthesrs 
and/or solid phase synthetic methods known in the art may be used. 
[01001 Starting materials and reagents used in practicing the specific 

20 embodiments of the invention derailed below and herein are either available from 
commercial suppliers such as Aldrich Chemical Company (Milwaukee, WI> 
Bachem (Torrance, CA), Sigma (St. Louis, MO), or are prepared by methods well 
known to a person of ordinary skill in the art following procedures described in such 
references as Fieser and Fieser 1991, "Reagents for Organic Synthesis", vols 1-17, 
25 John Wiley and Sons, New York, NY, 1991; Rodd 1989 "Chemistry of Carbon 
Compounds", vols. 1-5 and supps, Elsevier Science Publishers, 1989; "Organic 
Reactions", vols 1-40, John Wiley and Sons, New York, NY, 1991; March 2001, 
"Advanced Organic Chemistry", 5m ed. John Wiley and Sons, New York, NY; and 
Larock 1990, "Comprehensive Organic Transformations: A Guide to Functronal 
30 Group Preparations", 2" a ed. VCH Publishers. These schemes are merely Mustrabve 
of some embodiments of the present invention, and various modifications to these 
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schemes can be made and will be suggested to a petson of ordinal skill in the art 
having regard to this disclosure. 

[01011 In practicing the invention, starting materials, intermediates, and 

compounds may be isolated and purified using conventional techniques, inducting 
5 filtration, distillation, crystallization, chromatography, and the like. They may be 
characterized using conventional methods, including physical constants and spectral 
data. 

[0102] Methods for preparing glycopeptides (eg., O- or N-linked 

giycopeptides) and for conjugating peptides and glycopeptides to carriers are known 
,„ intheart. For example, guidance may be found in U.S. Patent No.: 6,660,714; U.S. 
Paten. Application Nos, 09/641,742; 10/209,618; and 10/728,041; U.S. Provistonal 
Patent Application Nos,. 60/500,161 and 60/500,708; and International Patent 
Application Nos, PCX/ US03/38453 and PCT/ US03/38471; each of the above- 
referenced patent documents are hereby incorporated by reference herem. 

15 [0103] Overview 

[0104] Following extensive earlier studies, wherein our chemistry was 

interfaced with important findings from outer laboratories," we demonstrated the 
ability to ligate fully synthetic Winked glycopeptides containing various complex 
otigosaecharides to larger polypeptide domains." For example, major glycosylated 
2 „ subsections of prostate specific antigen (PSA)- and gp!20» could be obtained. 
Provided herein is an exemplary convergent approach to access polypephdes 
multiply glycosylated at designated sites." As discussed previously, glycoprotems 
have potentially important clinical roles in the contexts of vaccines, diagnost.es, 
and therapeutics* The method of the present invention thus has tmportant 
25 applications in the field of medicine since it offers access to complex therapeutically 
useful glycoproteins. 

[0105] Scheme 7 schematizes an embodiment of the present invention. For 

example, glycopeptides 4 and 7 may be generated in a convergent way fiom then 
respective glycan and peptide precursors, and may be coupled with full convergence 
30 to reach 8. 

[0106] Scheme 7. A convergent synthesis of Afunctional glycopeptides. 
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fGlycan = complex oligosaccharide j 
[0107] In one embodiment, the synthesis of a suitable glycopeptide acyl 

donor for ligation with 7 was investigated. One approach was based on utilizing 
building block 4, in the form of a thioester, as acyl donor. However, the preparation 
of a thioester such as 4 by convergent means 16 appeared to present difficulties. 
10108] It was in pondering this problem that an attractive possibility 

presented itself (Scheme 8). The synthesis of a phenolic ester equipped with an - 
unsymmetrical disulfide {cf. 10) was then investigated. The phenolic ester linkage 
would in itself probably not manifest sufficient reactivity to serve as a viable 
acylating agent. However, equipped as it would be wim an ^-disulfide moiety, 
the phenolic ester could operate as an incipient acyl donor. Thus, reduction of the 
disulfide in 12, as shown, might well set the stage for in situ elaboration mto a 
thioester (cf. 13->14) by 0->S acyl migration. Thioester 14, present in an 
unfavorable, but dynamic equilibrium, could perhaps be interdicted by the 
machinery appropriate to native chemical ligation (NCL) 12 < ultimately culminating 
in 8. 

[01091 While the proposal for the intermediacy of a discreet thioester 14 is 

certainly attractive, it is not necessarily an obligatory event en route to ligation. 
Thus as suggested in structure 13, the presence of a free o^o-benzene thiol 
, function might well enhance the acylating facility of the phenolic ester, 17 perhaps 
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through in situ hydrogen bonding or anchimeric assistance. In either case, the 
sequence is set into motion only upon reductive cleavage of the disulfide. 
[0110] The synthesis of the masked glycopeptide acyl donor was 

accomplished as follows (Scheme 8). Commercially available 2-mercaptophenol (9) 
was oxidized to its symmetrical disulfide. Subsequent exchange with excess ethyl 
disulfide 18 furnished an unsymmetrical, aryl-alkyl disulfide. Acylation of the 
phenolic hydroxyl group with Boc-phenylalanine followed by acidic cleavage of the 
Boc protecting group afforded phenylalanine derivative 10. Standard peptide 
coupling of 10 with a fully protected peptide acid 19 followed by global deprotection 
provided ll, 20 which contains a free aspartate residue. Aspartylation of a 
chitobiose-derived glycosylamine with 11 afforded the virtual acyl donor 12, 
corresponding to our target structure 4 (Scheme 7). 

[0111] Scheme 8. A structure, synthesis, and proposed mechanism of a 

masked glycopeptide acyl donor. 3 


IX* 


Ph^ 10 x'/n 

Scheme 1 



intramolecular 
acyl transfer 


pH ~7.4, 
excess 
R-SH 


1^11 


Ph' 14 _ 



(G^=chitobiose in 12-15; (F^M = FmocRNRSG- in 12-15; R = CH 2 CH 2 S0 3 Na 
[0112] fl Reagents and conditions: (a) I 2 , MeOH, H 2 0; (b) BF 3 -OEt 2 , 

EtSSEt, CH 2 C1 2 , 99%, 2 steps; (c) Boc-Phe-OH, EDCI, DMAP, CH 2 C1 2 /THF, 93%; 
(d) 4N HCl/dioxane, 94%; (e) Fmoc-Arg(Pbf)-Asp('Bu)-Arg(Pbf)-Ser('Bu)-Gly-OH, 
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HATU, DIEA, DMF, 61%; (f) TFA:phenol:Et 3 SiH:H 2 0 35:2:1:1, 60%; (g) 
GlcNAcpi->4GlcNAcP-NH 2 , HATU, DffiA, DMSO, 52%. 

[01131 As a preliminary probe of the practicability of this scheme, 

phenylalanine derivative 10 was subjected to the neutral, aqueous, reducing reaction 
conditions described above, along with free cysteine (Scheme 9). The desired free 
dipeptide Phe-Cys (17) was obtained in 78% isolated yield. Only a single product 
was observed by LCMS and >H NMR, strongly suggesting that epimerization at the 
a-carbon of phenylalanine did not occur. 

[0114] Scheme 9. An example of the latent acyl donor function. 

Ptss ^ ^SH excess Q ^sh 

Ph^ 1(J 16 78% yield 17 

[0115] The synthesis of Afunctional glycopeptides was then carried out. 

Thus, 12 was treated with excess sodium 2-mercaptoethanesulfonate (MES-Na) at 
neutr'al P H in aqueous phosphate buffered saline (PBS) in the presence of 18, 21 
containing a latent N-terminal cysteine residue (Scheme 10). As observed by LCMS 
; during the reaction, 12 was almost instantaneously (less than 3 min) converted to the 
thioester derived from MES-Na (c/ 15); very little hydrolysis to the free carboxylic 
acid was noted. Over the course of the next several hours, we observed a gradual 
decrease in the amount of this thioester, with a concomitant increase in the desired 
product 19. Only one compound of the appropriate mass was observed by LCMS 
0 and *H NMR, again indicating a lack of epimerization. 

[0116] Scheme 10. Exemplary convergent coupling of fictionalized 

glycopeptides. 
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80 ^^ 8 ^^-NH ACHN H o°7cHnA 

Fmoc-Arg-Asn-Arg-Ser-Gly-Ns^^.Q'^sf -. -o 

I excess 
71% MES-Na, 
lpH-7.4 

HOT 


12 J teuSS' 

Ph 


AcHN AcHN L H O ACHN x 

F moo.Arg-As r n.Arg.Ser.G.y-N^ N r l - A ' a - Asn - A,a - Ser - NH2 


19 Phe-Cys SH 


[0117] With this proof of principle in hand, attention was directed to 

investigating the generality of the method. For example, applicability of the method 
to incorporation of O-linked domains 22 was established, as shown through the 

5 synthesis of 20. Notably, the histidine-containing system 21 was also assembled 
(for structure of 21 and 20, see Figure 1). Since activated C-terminal histidine 
residues are known to be particularly susceptible to epimerization, 23 a prophylactic 
dinitrophenyl (DNP) protecting group 24 was used for the imidazole x-nitrogen 
during the synthesis. As expected, the DNP group was cleaved concurrently during 

10 the mildly nucleophilic coupling reaction. No indications of epimerization were 
observed by LCMS or *H NMR. 

[01181 Validation of the highly convergent methodology in the context of 

complex glycans was then investigated. Syntheses of glycosylamines 22 13 and 23 25 
(Figure 2) have been reported previously. It will be noted that these compounds 
15 differ in a single stereogenic locus in the interior C-ring (see asterisks). Thus 22 
corresponds to a fully stereochemical^ competent iV-linked high mannose 
oligosaccharide. In contrast, its C-ring C-2 epimer is an unnatural stereochemical 
mutant. Studies aimed at evaluating whether high-mannose binding lectins can 
discriminate between point mutants that differ in only one out of a total of 25 
20 stereochemical loci are underway. 

[0119] Preparation of pentasaccharide-derived glycopeptide coupling 

partners proceeded via Lansbury aspartylation- of 22 and 23 with 25 and 26, which 


Page 46 of 74 


WO 2005/044841 


PCT/US2004/029047 


afforded 27 and 28, respectively. Coupling of 28 with a relatively simple 
chitobiose-containing acyl donor gave 24 (Figure 2) in 77% yield. 
[01201 Scheme 11. Convergent coupling of two functionalized 

glycopeptides containing distinct, structurally complex glycans. a 


Ac-Arg-Asp-Arg-Ser-Gly 


ACHN AcHN I H o|Y 

Ac-Arg-Asn-Arg-Ser-Gly-N^^pA 
27 Ph - : 


H 2 N-Cys-Ala-Asn-Ala-Ser-NH 2 
«BuS*.-S^ ,„ 


Ac-Aro^sn-ArgS^-Gly-Phe-Cys-Ala-AsWh 


[0121] a Reagents and conditions: (a) 25 or 26, HATU, DffiA, DMSO, 50% 

for 27 (from 22), 71% for 28 (from 23); (b) 20% piperidine in DMF, 63%; (c) 0.2 M 
phosphate, 0.2 M NaCl, pH -7.4, excess MES-Na, 75%. 

[0122] Coupling of 27 and 28 was accomplished smoothly upon reductive 

cleavage of the disulfide linkage in 27, affording 29 as shown in Scheme 11. Aside 
from validating the methodology in a striking way, the synthesis of 29 serves to 
pinpoint the power of the total synthesis approach: Since one of its carbohydrates is 
unnatural, 29 cannot readily be obtained from natural sources or via enzymatic 
manipulations. 

[0123] In summary, a convergent method for the synthesis of diglycosylated 

peptides is described herein. The mechanistic rationale set forth in Scheme 8 is 
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supported by the identification of the MES-Na thioester as a reaction intermediate. 
Furthermore, potential hydrolysis and C-terminal epimerization of the glycopeptide 
acyl donor have been suppressed. It seems likely that the method and logic set forth 
above will enjoy application in the building of complex glycopeptides of biological 
and even medicinal consequence. 

[0124] It will be appreciated that the method can readily be applied to the 

preparation of multi-glycosylated peptides (e.g., comprising more than two non- 
adjacent glycosides covalently linked at designated sites). 
[0125] General Rea ction Procedures: 

[0126] Unless mentioned specifically, reaction mixtures were stirred using a 

magnetically driven stirrer bar. Reactions involving air or moisture-sensitive 
reagents or intermediates were performed under argon or nitrogen atmosphere in 
glassware which had been heat gun or flame-dried under high vacuum. An inert 
atmosphere refers to either dry argon or dry nitrogen. Reactions were monitored 
either by thin layer chromatography, by proton nuclear magnetic resonance (NMR) 
or by high-pressure liquid chromatography (HPLC), of a suitably worked up sample 
of the reaction mixture. 

[0127] Indicated reaction temperatures refer to those of the reaction bath, 

while room temperature (rt) is noted as 22 °C. Preparative reactions were stirred 

) magnetically. Tetrahydrofuran (THF), diethyl ether (Et 2 0), methylene chloride 
(CH 2 C1 2 ), and toluene were obtained from a dry solvent system (activated alumina 
columns, positive pressure of argon). All other solvents were used as received in 
Sure/Seal bottles (Aldrich). Triemylarnine (Et 3 N), diisopropylethylamine (/- 
Pr 2 NEt), pyridine, and 2,6-lutidine were distilled from CaH 2 immediately prior to 

5 use. All other reagents were purchased from Aldrich at the highest commercial 
quality and used without further purification. 
[0128] General Wor k TTp Procedures: 

[0129] Unless mentioned specifically, reaction mixtures were cooled to room 

temperature or below then quenched, when necessary, with either water or a 
30 saturated aqueous solution of ammonium chloride. Desired products were extracted 
by partitioning between water and a suitable water-immiscible solvent {e.g. ethyl 
acetate, dichloromethane, diethyl ether). The desired product containing extracts 
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were washed appropriately with water followed by a saturated solution of brine. On 
occasions where the product containing extract was deemed to contain residual 
oxidants, the extract was washed with a 1 0% solution of sodium sulphite in saturated 
aqueous sodium bicarbonate solution, prior to the aforementioned washing 
5 procedure. On occasions where the product containing extract was deemed to 
contain residual acids, the extract was washed with saturated aqueous sodium 
bicarbonate solution, prior to the aforementioned washing procedure (except in those 
cases where the desired product itself had acidic character). On occasions where the 
product containing extract was deemed to contain residual bases, the extract was 
10 washed with 10% aqueous citric acid solution, prior to the aforementioned washing 
procedure (except in those cases where the desired product itself had basic 
character). Post washing, the desired product containing extracts were dried over 
anhydrous magnesium sulphate, and then filtered. The crude products were then 
isolated by removal of solvent(s) by rotary evaporation under reduced pressure, at an 
15 appropriate temperature (generally less than 45°C). 
[0130] General Purification Procedures: 

[0131] Unless mentioned specifically, chromatographic purification refers to 

flash column chromatography on silica, using a single solvent or mixed solvent as 
eluent. Suitably purified desired product containing elutes were combined and 
20 concentrated under reduced pressure at an appropriate temperature (generally less 
than 45°C) to constant mass. Final compounds were dissolved in 50% aqueous 
acetonitrile, filtered and transferred to vials, then freeze-dried under high vacuum 
before submission for biological testing. 
[0132] Analytical Equipment: 

25 [0133] HPLC: All separations involved a mobile phase of 0.05% TFA (v/v) 

in water (solvent A)/0.0425% TFA in acetonitrile (solvent B). Preparative, 
semipreparative, and analytical HPLC separations were performed using a Rainin 
HXPL solvent delivery system equipped with a Rainin UV-1 detector and one of the 
following Dynamax-60A C18 axial compression columns 250 mm in length 
30 equipped with a similarly packed guard column: 41.4 mm diameter (prep), 21.4 m 
diameter (semiprep), or 4.6 mm diameter (analytical). Separations were performed 
at flow rates of 48 mL/min (prep), 16 mL/min (semiprep), or 1 mL/min (analytical), 
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and were monitored at a wavelength between 214 and 230 nm, depending on column 

loading. LCMS chromatographic separations were performed using a Waters 2695 

Separations Module and a Waters 996 Photodiode Array Detector equipped with a 

Varian Microsorb C18 2 x 150 mm column at a flow rate of 0.2 mL/min. 

[0134] ESMS and LCMS: Electrospray mass spectroscopy and LCMS 

analyses were obtained on a Waters Micromass ZQ mass spectrometer in 

conjunction with the Waters HPLC apparatus described above. 

[0135] NMR: ! H and 13 C NMR spectra were recorded on Bruker 

instruments in CDCI3, CD3OD or D 2 0 at 400 or 500 MHz for >H and 100 or 125 

MHzfor 13 C. i 
[0136] Reagents: All commercial materials were used as received unless 

otherwise noted. The following solvents were obtained from a dry solvent system 
and used without further purification: THF, diethyl ether, toluene, and DCM. 
Reagents were obtained from Aldrich or as noted, with the following exceptions: 
amino acids and resins for solid phase peptide synthesis were purchased from 
NovaBiochem; Biosynthesis grade DMF from EM Science; and all other solvents 
from Fisher Scientific (HPLC grade). 

oh [0137] To a stirred, biphasic solution of 2-mercaptophenol 

QvS^i (1.0 g, 7.9 mmol) in H 2 0 (5.1 mL) was added, drop wise; a solution 
Ho^T of iodine (1.0 g 4.0 mmol) in methanol (3.5 mL). When the brown 
iodine color persisted the solution was diluted with ethyl acetate and water. The 
aqueous layer was removed and extracted with an additional portion of ethyl acetate. 
The combined organic layers were dried and washed with brine then dried (Na 2 S0 4 ) 
and concentrated to give a brown oil which was used without purification (1.5 g). 
,5 The product still contains iodine. >H NMR (CDCI3, 500 MHz): 5 7.33-7.37 (m, 2H), 
7 22-7.24 (m, 2H), 6.99-7.01 (m, 2H), 6.82-6.85 (m, 2H), 6.22 (brs, 2H). l H NMR 
(CDCI3, 400 MHz): 8 7.35 (dt, J- 8.16, 1.65 Hz, 2H), 7.22 (dd, J= 7.52, 1.65 Hz, 
2H), 7.00 (dd, J= 8.16, 1.15 Hz, 2H), 6.83 (dt, J= 7.52, 1.15 Hz, 2H), 6.22 (s, 2H) 
«C NMR (CDCI3, 125 MHz): 8 157.3, 136.6, 133.6, 121.4, 120.3, 116.1. ESI-MS: 
30 Calcd. for C 12 H 10 O 2 S 2 IM+NH*]* 267.8 Found: 267.8. 

[0138] To a stirred solution of the disulfide (1.5 g crude, ~4 

OC°"s^ mmol) in CH 2 C1 2 (25 mL) was added ethyldisulfide (10.5 mL, 79.9 
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mmol) and then BF 3 .OEt 2 (10.1 mL, 79.9 mmol). The reaction was stirred at room 
temperature for three hours and then carefully quenched by the addition of (aq.) 
NaHC0 3 . The organic layer was drained and the aqueous layer was extracted with 
an additional portion of CH 2 C1 2 . The combined organic layers were dried (MgS0 4 ) 
and concentrated to give a yellow oil. Purification by silica gel chromatography 
(20% ethyl acetate in hexane) gave the desired product as a clear, slightly yellow oil 
(1 45 g 99%). l H NMR (CDC1 3 , 500 MHz): 6 7.48-7.51 (m, 1H), 7.28-7.32 (m, 
1H), 6.99-7.01 (m, 1H), 6.86-6.9 (m, 1H), 6.34 (fan, 1H), 2.78 (q, J= 7.4 Hz, 2H), 
1 35 (t J- 7.4 Hz, 3 H). 13 C NMR (CDC1 3 , 125 MHz): 8 156.9, 135.2, 132.2, 
121.0, 116.2, 32.4, 14.1. ESI-MS: Calcd. for C 8 H 10 OS 2 [M+Naf 208.8 Found: 
208.8. 

[0139] To a solution of the phenol (1.45 g, 8 mmol) and 

Boc-Phe-OH (2.65 g, 10 mmol) in CH 2 C1 2 (25 mL) and THF (5 
added EDCI 


BocHN 



mL) was 


V (1.92 g, 10 mmol) and DMAP (98 mg, 0.8 mmol). The resulting 
solution was stirred at room temperature for 18 hr at which point the volatile 
materials were removed in vacuo. The resulting oil was taken up in EtOAc and 
washed with IN HC1, H 2 0, and then brine. The organic layer was dried (Na 2 S0 4 ) 
and concentrated to give a slightly yellow oil. Purification by silica gel 
20 chromatography (30% ethyl acetate in hexane) gave a clear, colorless oil (3.5 g, 
> 99 o/ 0 ) i H NMR (CDCI3, 400 MHz): 8 7.78-7.81 (m 1H), 7.23-7.36 (m, 7H), 7.01- 
7 03 (m, 1H), 5.02 (d, J = 8.16 Hz, 1H), 4.86-4.91 (m, 1H), 3.31 (abx, 
J ab = 13.98 Hz, J m = 5.41 Hz, J bx = 7.22 Hz, 2H), 2.72 (q, J= 7.33 Hz, 2H), 1.42 (s, 
9H), 1.29 (t, J= 7.33 Hz, 3H). ESI-MS: Calcd. for C 22 H 27 N0 4 S 2 [M+Naf 456.0 
25 Found: 456.0. 

^ [0140] To a 250 mL round-bottomed flask equipped with a 

magnetic stir bar was added the above phenylalanine derivative (3.0 

H2 o °DO g' 69 mmol) aS Wdl aS a 4M SOlUti ° n ° f * di0XanC 

io°I E t 3 o" 345 mmol) and the reaction was stirred under argon at room 

temperature for 1.5 hr. At that point the reaction was concentrated in vacuo, leaving 

a white solid. This material was triturated with ether and subsequently concentrated. 
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This process was repeated (3x) leaving the crude product as a white solid. The 
material was dissolved in 30% B, shell frozen and lyophilized to give a white 
powder (2.17 g, 94%). l H NMR (CD3OD, 400 MHz): 8 7.93-7.95 (m, 1H), 7.52- 
7 42 (m, 7H), 7.23-7.21 (m,lH), 4.81 (dd, J- 8.35, 5.73 Hz, 1H), 3.56 (abx, J ab = 
1448 Hz,J flX =5.73Hz,^=8.35Hz, 2^2.85(^^=7.34,2^,1.37(^=7.34, 

3H) 13 C NMR (CDCI3, 125 MHz): 5 169.0, 149.4, 135.6, 131.9, 131.3, 131.0, 
130.7, 129.9, 129.6, 129.1, 123.9, 55.6, 38.0, 34.1, 14.9. ESI-MS: Calcd. for 
Ci 7 H 19 N0 2 S2 [M+H] + 334.1 Found: 334.2. 

^ [01411 The phenylalanine derivative 10 (15 mg, 45 nmol) 

X 3 t ^ L - c y steine < 6 mg ' 49 ^ mol) were placed a LCMS ^ 

wYt 1 * along with a flea-sized stirbar. In a second vial were mixed MES- 
Na (25 mg! H 150 junol) and phosphate buffered saline (PBS) (0.2M NaCl, 0.2M 
phosphate, P H = 7.5, 2 mL). The MES-Na solution was then added directly to the 
amino acids, and the reaction was monitored by LCMS. After two hours the 
15 reaction appeared to be complete and TCEP (129 mg, 450 ^mol) was added. Tins 
was stirred for 1 hour and then injected directly onto the HPLC for purification. The 
desired compound was obtained as a white powder after lyophilization (9.4 mg, 
78o/o). *H NMR (D2O): 5 7.36-7.45 (m, 3H), 7.31-7.33 (m, 2H), 4.63 (dd, J- 6.61, 
4 83 Hz, 1H), 4.34 (dd, J= 7.24, 7.18 Hz, 1H), 3.25 (abx, J ab = 14.1 Hz, J„ - 7.18 
20 Hz, J bx = 7.24 Hz, 2H), 2.96 (abx, J ab = 14.2 Hz, J„ = 4.83 Hz, J hx = 6.61 Hz, 2H). 
LCMS: 5-65%B over 20 min, rt = 8.53 min. HPLC: 5-95%B over 30 mm, rt - 
11.45 min. ESI-MS: Calcd. for C 12 H 16 N 2 0 3 S [M+H] + 269.1, Found: 269.1 
[0142] Exemplary ligation conditions: 

[0143] The two glycopeptide halves (12, 2.2 mg, 1.44 ^mol) (18, 1.4 mg, 

25 1 44 umol) were placed in a LCMS vial along with a flea-sized stirbar. A stock 
solution of MESNa (18.3 mg, 111 mmol) in phosphate buffered saline (0.2M NaCl, 
0.2M phosphate, pH = 7.4, 1 mL) was made and of this, 600 uL was added to the 
glycopeptides. The reaction was monitored by LCMS and, once finished, TCEP (25 
mg, 0.087 mmol) was added and the solution stirred for 2 hr then injected directly 
30 onto the HPLC for purification. 
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AcHN HO-^TX= .OH >^ O , H O , H O 

HN^ HhT 
HN^NH 2 HN^NHz 

101441 LCMS: 5-65%B over 20 min, rt = 9.98 min. HPLC: 25-55%B over 

1 min, 
30 

5 rt = 7.65 min. ESI-MS: Calcd. for C 93 H 13 sN 24 0 37 S [M + 2H] 2+ 1108.5, Found: 
1108.6, [M+3H] 3+ 739.3, Found: 739.5. 

HO° H 


HO^ f> 


„7 ^ 


HO — v ^ st t r 

Sh 2 HN^NH 2 20 

roi451 LCMS: 5-65%B over 20 min, rt = 11.40 min. HPLC: 5-6 5 o/oB over 

1 min, 

rtl 9.85 min. ESI-MS: Calcd. for C, 3 H, 2 oN 22 0 3 „S [M + 2Hf 909.4, Found: 909.5, 


[M+3H] 3+ 606.6, Found: 606.8 
ho^ ho^ 

"iesfri&&r ... o 


, ^oA>^-NH i nh 


HN^NH 2 HiAnH 2 


15 [0146] LCMS:5-65o/oBover20min,rt = 4.02min. HPLC: 5-65%B over 20 

nun, rt - 6.71 min. ESI-MS: Calcd. for C 73 H 120 N 22 O 30 S [M + 2H] 2+ 898.4, Found: 


898.6. 
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AcHN 


[0147] LCMS:5-65%Bover20rnin,rt=7.81rnin. HPLC: 5-65%B over 20 

Bta. rt - 8.39 min. ESI-MS: Calcd. for CrffaMAnS [M + 2H) 2+ 1261.5, Found: 
1261.5, [M+3H] 3 * 841.4, Found: 841.5. 


HO >OH 


f = " 6 f HO ° H U ^SH - I 

HN NH 2 HN^NH 2 H0 ^^R H °^Q H °T\. V° 


HO^ OH 


[ 0 148] LCMS: 5-45%B over 20 min, rt = 10.82 min. HPLC: 5-45%B over 

min, 

20 

rt = 9.31 min. ESI-MS: Calcd. for C 98 H 160 N 24 O 51 S [M + 2H] 2+ 1504.6, Found: 
1504.9, [M+3H] 3+ 1003.4, Found: 1003.7. 

ABBREVIATIONS AND GLOSSARY 


[0149] 

A: alanine 

[0150] 

Ac: acetyl 

[0151] 

Ala: alanine 

15 [0152] 

Arg: arginine 

[0153] 

Asn: asparagine 

[0154] 

Asp: aspartic acid 

[0155] 

Bn: benzyl 

[0156] 

Boc: ter/-butyloxycarbonyl 
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[0157] Bu: butyl 

[0158] Bz: benzoyl 


[0159] 
[0160] 


CAN: eerie ammonium nitrate 
C-terminus: peptide carbonyl terminus 

5 [0161] Cys: cysteine 

[0162] D: aspartic acid 

[0163] DIEA: iV^-diisopropylethylamine 

[0164] DMAP: AT^-Dimethylaminopyridine 

[0165] DMF ; dimethyl formamide 

10 [0166] DMSO: dimethyl sulfoxide 

[0167] DTBMP: di-ferr-butylmethylpyridine 

[0168] DTBP: di-terf-butylpyridine 

[0169] Et: ethyl 

[0170] Fmoc: 9-fluorenylmethyloxycarbonyl 

15 [0171] G: glycine 

[0172] Gal: galactose 

[0173] Glc: glucose 

[0174] Gin: glutamine 

[01751 Glu: glutamic acid 

20 [0176] Gly: glycine 

10177] H: histidine 

[0178] HATU: 7-azahydroxybenzotriazolyl tetramefhylurommn 
hexafluorophosphate 


[0179] 

His: histidine 

25 [0180] 

He: isoleucine 

[0181] 

K: lysine 

[0182] 

kDa: kilodaltons 

[0183] 

KLH: keyhole limpet hemocyanin 

[0184] 

L: leucine 

30 [0185] 

Leu: leucine 

[0186] 

Lys: lysine 

[0187] 

Man: mannose 
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[0188] MES-Na: 2-mercaptoethanesulfonic acid, sodium i 

[0189] MHC: major histocompatibility complex 

[0190] N: asparagine 

[0191] NAc: AT-acetyl 

5 [0192] NCL: native chemical ligation 

[0193] N-terminus: peptide amine terminus 

[0194] O-linked: linked through an ethereal oxygen 

[0195] Pam3Cys: tripalmitoyl-S-glycerylcysteinylserine 

[0196] PBS: phosphate-buffered saline 

10 [0197] Ph: phenyl 

[0198] PMB-.p-methoxybenzyl 

[0199] Pro- proline 

[0200] GP120: prostate specific antigen 

[0201] Py: pyridine 

15 [0202] QS21: a glycosteroidal immunoadjuvant 

[0203] R: arginine 

[0204] S: serine 

[0205] sat. aq.: saturated aqueous 

[0206] Ser: serine 

20 [0207] T: threonine 

[0208] TBAF: : tetra-«-butylammonium fluoride 

[0209] TBS: te^butyldimethylsilyl 

[0210] 'Bu: tert-butyl 

[0211] Tf: trifluoromethanesulfonate 

25 [0212] TFA: Trifluoroacetic acid 

[0213] THF: tetrahydrofuran 

[0214] Thr: threonine 

[0215] t-GP120: total prostate specific antigen 

[0216] Trp: tryptophan 

30 [0217] V: valine 

[0218] Val: valine 

[0219] W: tryptophan 
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